Skip to content
View ChristianWeyer's full-sized avatar

Organizations

@thinktecture

Block or report ChristianWeyer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Docker configuration for running VLLM on dual DGX Sparks

Shell 42 13 Updated Dec 24, 2025

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

C# 3,471 480 Updated Dec 23, 2025

The TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama.

TypeScript 18,980 1,365 Updated Dec 24, 2025

An open-source AI Voice Agent that integrates with Asterisk/FreePBX using Audiosocket/RTP technology

Python 184 43 Updated Dec 23, 2025

Efficient few-shot learning with Sentence Transformers

Jupyter Notebook 2,649 253 Updated Dec 11, 2025

Real-time Visualizer for Neural Networks

Python 5 1 Updated Nov 30, 2025

Collection of step-by-step playbooks for setting up AI/ML workloads on NVIDIA DGX Spark devices with Blackwell architecture.

TypeScript 271 85 Updated Dec 23, 2025

Unified Schema-Based Information Extraction

Python 397 41 Updated Dec 19, 2025

OGRAG - Release Version

Python 51 16 Updated Nov 11, 2025

Simple Graph RAG demo based on Jaguar data

Python 136 32 Updated Nov 23, 2025

🌐 The open-source Agentic browser; privacy-first alternative to ChatGPT Atlas, Perplexity Comet, Dia.

C++ 8,325 805 Updated Dec 24, 2025

Examples, end-2-end tutorials and apps built using Liquid AI Foundational Models (LFM) and the LEAP SDK

647 86 Updated Dec 23, 2025

Learn to build and deploy local Visual Language Models for Edge AI

Jupyter Notebook 333 39 Updated Oct 30, 2025

RamaLama is an open-source developer tool that simplifies the local serving of AI models from any source and facilitates their use for inference in production, all through the familiar language of …

Python 2,410 287 Updated Dec 22, 2025

Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs.

C++ 560 24 Updated Dec 23, 2025

FastAPI + MLX offline-first voice agent with <1s latency. Minimal UI

Python 41 8 Updated Oct 21, 2025

This course is designed to guide beginners through the exciting world of Edge AI, covering fundamental concepts, popular models, inference techniques, device-specific applications, model optimizati…

Jupyter Notebook 1,223 230 Updated Dec 24, 2025

Awesome LLM speech-to-speech models and frameworks

29 3 Updated Nov 17, 2025

your private, personal assistant

TypeScript 58 12 Updated Sep 30, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9,202 835 Updated Nov 20, 2025

Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

Python 2,667 272 Updated Nov 26, 2025

This prototype shows how to build a local Retrieval-Augmented Generation (RAG)

Python 21 5 Updated Sep 8, 2025

Make text LLMs listen and speak

Python 1,048 180 Updated Dec 23, 2025

CLI-based tester for verifying that MCP servers work correctly when called directory and by agents

TypeScript 33 2 Updated Sep 16, 2025

AMD Ryzen™ AI Software includes the tools and runtime libraries for optimizing and deploying AI inference on AMD Ryzen™ AI powered PCs.

Python 718 110 Updated Dec 16, 2025

LLM agents built for control. Designed for real-world use. Deployed in minutes.

Python 16,773 1,405 Updated Dec 24, 2025

ollama like cli tool for MLX models on huggingface (pull, rm, list, show, serve etc.)

Python 120 8 Updated Dec 23, 2025

Fast ML Inference Building Blocks Library in C#

C# 37 4 Updated Nov 18, 2025

A high-performance API server that provides OpenAI-compatible endpoints for MLX models. Developed using Python and powered by the FastAPI framework, it provides an efficient, scalable, and user-fri…

Python 161 32 Updated Dec 21, 2025

The official C# SDK for Model Context Protocol servers and clients. Maintained in collaboration with Microsoft.

C# 3,713 590 Updated Dec 22, 2025
Next