Lists (4)
Sort Name ascending (A-Z)
Starred repositories
MiniCPM5-1B: A SOTA 1B on-device LLM, small yet powerful.
VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
🌌 AI Platform leveraging AI agents & ML models for exoplanet discovery - Nasa Space App Challenge 2025 (A World Away: Hunting for Exoplanets with AI)
Open-source, local-first AI journal app for iOS and Android. Capture text, photos, and voice — AI agents organize them into timeline cards and insights. Your data stays on your device. Bring your o…
HRM-Text is a 1B text generation model based on the HRM architecture, strengthened by task completion and latent space reasoning.
Browser automation CLI built for AI agents. Break through anti-bot walls, hand off to humans across platforms when stuck. Parallel multi-task execution, independent multi-session operation, isolate…
A 3B-active-parameter native unified multimodal model for image and video understanding, generation, and editing.
A collection of c++ programs that demonstrate common ways to detect the presence of an attached debugger.
Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.
a collection of skills for vllm-omni
FlashInfer: Kernel Library for LLM Serving
an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM
Multimodal Orchestration for Artifacts — AI model lifecycle engine with 7-provider routing, circuit breaker, preflight prediction
KASLD derandomises the Linux kernel's virtual and physical memory layout as an unprivileged local user.
Extract and analyze environment variables from running Linux processes.
A framework for efficient model inference with omni-modality models
Repair malformed JSON from LLMs, APIs, logs, and user input in Python.
注释的nano_vllm仓库,并且完成了MiniCPM4的适配以及注册新模型的功能
A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models
A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.
A high-throughput and memory-efficient inference and serving engine for LLMs
Community maintained hardware plugin for vLLM on Ascend