yangrudan

Follow

🎯

Focusing

Cookie yangrudan

🎯

Focusing

Follow

Yep

18 followers · 27 following

Achievements

Achievements

Lists (4)

Sort

🐒GPU

11 repositories

🗡️hardware

🎭 Leaf

🚀 My stack

Starred repositories

OpenBMB / MiniCPM

MiniCPM5-1B: A SOTA 1B on-device LLM, small yet powerful.

Jupyter Notebook 9,441 621 Updated Jun 12, 2026

OpenBMB / VoxCPM

VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

Python 28,799 3,256 Updated Jun 10, 2026

3b1b / manim

Animation engine for explanatory math videos

Python 87,566 7,309 Updated Apr 18, 2026

Paulhb7 / the-astronomists-nasa-space-app-challenge-2025

🌌 AI Platform leveraging AI agents & ML models for exoplanet discovery - Nasa Space App Challenge 2025 (A World Away: Hunting for Exoplanets with AI)

TypeScript 9 1 Updated Jan 15, 2026

memex-lab / memex

Open-source, local-first AI journal app for iOS and Android. Capture text, photos, and voice — AI agents organize them into timeline cards and insights. Your data stays on your device. Bring your o…

Dart 442 39 Updated Jun 13, 2026

sapientinc / HRM-Text

HRM-Text is a 1B text generation model based on the HRM architecture, strengthened by task completion and latent space reasoning.

Python 1,280 118 Updated May 27, 2026

browser-act / skills

Browser automation CLI built for AI agents. Break through anti-bot walls, hand off to humans across platforms when stuck. Parallel multi-task execution, independent multi-session operation, isolate…

Python 2,448 98 Updated Jun 12, 2026

bytedance / Lance

A 3B-active-parameter native unified multimodal model for image and video understanding, generation, and editing.

Python 1,197 79 Updated Jun 13, 2026

ThomasThelen / Anti-Debugging

A collection of c++ programs that demonstrate common ways to detect the presence of an attached debugger.

C++ 622 83 Updated Dec 28, 2021

chopratejas / headroom

Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

Python 25,729 1,702 Updated Jun 13, 2026

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 6,007 532 Updated May 4, 2026

hsliuustc0106 / vllm-omni-skills

a collection of skills for vllm-omni

Python 76 24 Updated Jun 8, 2026

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Python 5,786 1,046 Updated Jun 13, 2026

DLYuanGod / MegaTrain

Python 607 59 Updated May 21, 2026

mit-han-lab / mlsys2026-flashinfer-contest

Python 82 3 Updated May 24, 2026

aaif-goose / goose

an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM

Rust 49,184 5,190 Updated Jun 13, 2026

mofa-org / mofa-engine

Multimodal Orchestration for Artifacts — AI model lifecycle engine with 7-provider routing, circuit breaker, preflight prediction

Rust 3 Updated May 31, 2026

bcoles / kasld

KASLD derandomises the Linux kernel's virtual and physical memory layout as an unprivileged local user.

C 493 52 Updated Jun 13, 2026

bcoles / envex

Extract and analyze environment variables from running Linux processes.

Rust 4 Updated Mar 22, 2026

vllm-project / vllm-omni

A framework for efficient model inference with omni-modality models

Python 5,130 1,106 Updated Jun 13, 2026

mangiucugna / json_repair

Repair malformed JSON from LLMs, APIs, logs, and user input in Python.

Python 4,971 200 Updated Jun 9, 2026

LDLINGLINGLING / nano_vllm_note

注释的nano_vllm仓库，并且完成了MiniCPM4的适配以及注册新模型的功能

Python 192 32 Updated Aug 11, 2025

GeeeekExplorer / nano-vllm

Nano vLLM

Python 14,010 2,209 Updated Apr 26, 2026

gpustack / gpustack

A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.

Python 5,151 546 Updated Jun 12, 2026

gogongxt / nano-sglang

Python 157 19 Updated Mar 5, 2026

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,567 848 Updated Jun 12, 2026

sgl-project / sglang-omni

SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models

Python 489 206 Updated Jun 13, 2026

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,386 698 Updated May 17, 2026

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 82,766 18,020 Updated Jun 13, 2026

vllm-project / vllm-ascend

Community maintained hardware plugin for vLLM on Ascend

C++ 2,237 1,388 Updated Jun 13, 2026

Starred topics

Python

Awesome Lists

Git

Emacs

Deep learning

Code review

Chrome