Your Personal AI Assistant; easy to install, deploy on your own machine or on the cloud; supports multiple chat apps with easily extensible capabilities.

Python 15,104 2,036 Updated Apr 12, 2026

lightseekorg / smg-dev-guide

Shepherd Model Gateway Claude Code Skills

7 1 Updated Mar 9, 2026

sgl-project / sglang-omni

SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models

Python 182 74 Updated Apr 11, 2026

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,079 667 Updated Apr 12, 2026

v6d-io / v6d

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

C++ 949 131 Updated Jan 22, 2026

lightseekorg / smg

Engine-agnostic LLM gateway in Rust. Full OpenAI & Anthropic API compatibility across SGLang, vLLM, TRT-LLM, OpenAI, Gemini & more. Industry-first gRPC pipeline, KV cache-aware routing, chat histor…

Rust 159 43 Updated Apr 12, 2026

ulab-uiuc / LLMRouter

LLMRouter: An Open-Source Library for LLM Routing

Python 1,634 151 Updated Mar 17, 2026

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,967 566 Updated Mar 13, 2026

DayuanJiang / next-ai-draw-io

A next.js web application that integrates AI capabilities with draw.io diagrams. This app allows you to create, modify, and enhance diagrams through natural language commands and AI-assisted visual…

TypeScript 26,926 2,839 Updated Apr 12, 2026

openai / openai-realtime-console

React app for inspecting, building and debugging with the Realtime API

JavaScript 3,573 1,413 Updated Aug 28, 2025

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 8,763 762 Updated Mar 26, 2026

vllm-project / vllm-omni

A framework for efficient model inference with omni-modality models

Python 4,245 735 Updated Apr 12, 2026

alibaba / InferSim

A Lightweight LLM Inference Performance Simulator

Python 67 19 Updated Mar 18, 2026

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 22,811 2,098 Updated Jan 27, 2026

bytecodealliance / wamr-rust-sdk

Rust 69 30 Updated Apr 6, 2026

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,690 5,301 Updated Apr 12, 2026

yjs / yjs

Shared data types for building collaborative software

JavaScript 21,620 762 Updated Apr 11, 2026

bytecodealliance / wasm-micro-runtime

WebAssembly Micro Runtime (WAMR)

C 5,885 785 Updated Apr 8, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tony Lu tonyluj

Achievements

Achievements

Block or report tonyluj

Starred repositories

MemPalace / mempalace

vllm-project / router

alibaba / rtp-llm

openclaw / openclaw

ghostty-org / ghostty

langchain-ai / langchain

envoyproxy / ai-gateway

iDvel / rime-ice

wezterm / wezterm

VoltAgent / awesome-claude-code-subagents

AlexxIT / go2rtc

nearai / ironclaw

agentscope-ai / QwenPaw