Skip to content
View FeiGSSS's full-sized avatar
😄
I may be slow to respond.
😄
I may be slow to respond.
  • Beijing
  • 22:33 (UTC -12:00)

Block or report FeiGSSS

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
490 results for source starred repositories
Clear filter
Python 36 2 Updated Feb 5, 2026

分享AI Infra知识&代码练习:PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等

Jupyter Notebook 329 30 Updated Feb 7, 2026

Persist and reuse KV Cache to speedup your LLM.

Python 249 61 Updated Feb 6, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,344 414 Updated Feb 6, 2026

Boosting RAG on model and system performance with context reuse

Python 19 1 Updated Jan 28, 2026
Python 36 3 Updated Oct 16, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 5,078 438 Updated Feb 7, 2026

Contexts Optical Compression

Python 22,409 2,059 Updated Jan 27, 2026

[ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Python 524 38 Updated Feb 10, 2025

Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…

Python 10,841 1,245 Updated Feb 6, 2026

🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSy…

3,669 362 Updated Jul 25, 2025

The best ChatGPT that $100 can buy.

Python 42,451 5,482 Updated Feb 6, 2026

chat log tool, easily use your own chat data. 聊天记录工具,轻松使用自己的聊天数据

9,184 2,635 Updated Oct 20, 2025

Nano vLLM

Python 11,542 1,535 Updated Nov 3, 2025

Minimalist vLLM implementation in Rust

Rust 112 17 Updated Feb 6, 2026

Zotero plugin to automatically move attachments and link them

JavaScript 1,190 27 Updated Jan 28, 2026

⛷ Lightweight Markdown app to help you write great sentences.

Swift 7,498 441 Updated Feb 7, 2026

😼 优雅地使用基于 clash/mihomo 的代理环境

Shell 8,822 1,054 Updated Jan 29, 2026

Curated collection of papers in MoE model inference

341 12 Updated Oct 20, 2025

Machine Learning Engineering Open Book

Python 16,586 1,034 Updated Jan 23, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,741 2,032 Updated Jan 13, 2026

My learning notes for ML SYS.

Python 5,290 342 Updated Jan 30, 2026

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team.

Python 15,361 1,070 Updated Feb 3, 2026

[EMNLP 2025] LightThinker: Thinking Step-by-Step Compression

Python 132 5 Updated Apr 12, 2025

The official implementation of "ML-Master: Towards AI-for-AI via Integration of Exploration and Reasoning"

Python 358 44 Updated Jan 16, 2026

AG-UI: the Agent-User Interaction Protocol. Bring Agents into Frontend Applications.

Python 11,852 1,086 Updated Feb 7, 2026

A sparse attention kernel supporting mix sparse patterns

C++ 453 45 Updated Jan 18, 2026

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 16,483 1,210 Updated Feb 7, 2026

A simple and trans-platform agent framework and tutorial

Jupyter Notebook 199 41 Updated Jan 17, 2026
Next