Skip to content
View yangrudan's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report yangrudan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

MiniCPM5-1B: A SOTA 1B on-device LLM, small yet powerful.

Jupyter Notebook 9,441 621 Updated Jun 12, 2026

VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

Python 28,799 3,256 Updated Jun 10, 2026

Animation engine for explanatory math videos

Python 87,566 7,309 Updated Apr 18, 2026

🌌 AI Platform leveraging AI agents & ML models for exoplanet discovery - Nasa Space App Challenge 2025 (A World Away: Hunting for Exoplanets with AI)

TypeScript 9 1 Updated Jan 15, 2026

Open-source, local-first AI journal app for iOS and Android. Capture text, photos, and voice — AI agents organize them into timeline cards and insights. Your data stays on your device. Bring your o…

Dart 442 39 Updated Jun 13, 2026

HRM-Text is a 1B text generation model based on the HRM architecture, strengthened by task completion and latent space reasoning.

Python 1,280 118 Updated May 27, 2026

Browser automation CLI built for AI agents. Break through anti-bot walls, hand off to humans across platforms when stuck. Parallel multi-task execution, independent multi-session operation, isolate…

Python 2,448 98 Updated Jun 12, 2026

A 3B-active-parameter native unified multimodal model for image and video understanding, generation, and editing.

Python 1,197 79 Updated Jun 13, 2026

A collection of c++ programs that demonstrate common ways to detect the presence of an attached debugger.

C++ 622 83 Updated Dec 28, 2021

Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

Python 25,729 1,702 Updated Jun 13, 2026

Open-source unified multimodal model

Python 6,007 532 Updated May 4, 2026

a collection of skills for vllm-omni

Python 76 24 Updated Jun 8, 2026

FlashInfer: Kernel Library for LLM Serving

Python 5,786 1,046 Updated Jun 13, 2026
Python 607 59 Updated May 21, 2026

an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM

Rust 49,184 5,190 Updated Jun 13, 2026

Multimodal Orchestration for Artifacts — AI model lifecycle engine with 7-provider routing, circuit breaker, preflight prediction

Rust 3 Updated May 31, 2026

KASLD derandomises the Linux kernel's virtual and physical memory layout as an unprivileged local user.

C 493 52 Updated Jun 13, 2026

Extract and analyze environment variables from running Linux processes.

Rust 4 Updated Mar 22, 2026

A framework for efficient model inference with omni-modality models

Python 5,130 1,106 Updated Jun 13, 2026

Repair malformed JSON from LLMs, APIs, logs, and user input in Python.

Python 4,971 200 Updated Jun 9, 2026

注释的nano_vllm仓库,并且完成了MiniCPM4的适配以及注册新模型的功能

Python 192 32 Updated Aug 11, 2025

Nano vLLM

Python 14,010 2,209 Updated Apr 26, 2026

A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.

Python 5,151 546 Updated Jun 12, 2026
Python 157 19 Updated Mar 5, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,567 848 Updated Jun 12, 2026

SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models

Python 489 206 Updated Jun 13, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,386 698 Updated May 17, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 82,766 18,020 Updated Jun 13, 2026

Community maintained hardware plugin for vLLM on Ascend

C++ 2,237 1,388 Updated Jun 13, 2026
Next