Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…

Python 9,912 1,101 Updated Dec 23, 2025

HuaizhengZhang / AI-Infra-from-Zero-to-Hero

🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSy…

3,493 354 Updated Jul 25, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 39,127 4,954 Updated Dec 9, 2025

deepseek-ai / DeepSeek-V3.2-Exp

Python 1,380 114 Updated Nov 18, 2025

sjzar / chatlog

chat log tool, easily use your own chat data. 聊天记录工具，轻松使用自己的聊天数据

9,112 2,312 Updated Oct 20, 2025

GeeeekExplorer / nano-vllm

Nano vLLM

Python 10,035 1,256 Updated Nov 3, 2025

guoqingbao / vllm.rs

Minimalist vLLM implementation in Rust

Rust 83 13 Updated Dec 23, 2025

wileyyugioh / zotmoov

Zotero plugin to automatically move attachments and link them

JavaScript 1,116 22 Updated Dec 12, 2025

tw93 / MiaoYan

⛷ Lightweight Markdown app to help you write great sentences.

Swift 7,188 419 Updated Dec 23, 2025

nelvko / clash-for-linux-install

😼 优雅地使用基于 clash/mihomo 的代理环境

Shell 7,165 880 Updated Dec 23, 2025

MoE-Inf / awesome-moe-inference

Curated collection of papers in MoE model inference

320 11 Updated Oct 20, 2025

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 16,085 987 Updated Dec 20, 2025

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,460 1,999 Updated Nov 1, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes for ML SYS.

Python 4,771 303 Updated Dec 22, 2025

QwenLM / Qwen3-Coder

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.

Python 14,716 1,024 Updated Dec 4, 2025

zjunlp / LightThinker

[EMNLP 2025] LightThinker: Thinking Step-by-Step Compression

Python 126 5 Updated Apr 12, 2025

sjtu-sai-agents / ML-Master

The official implementation of "ML-Master: Towards AI-for-AI via Integration of Exploration and Reasoning"

Python 213 33 Updated Dec 16, 2025

ag-ui-protocol / ag-ui

AG-UI: the Agent-User Interaction Protocol. Bring Agents into Frontend Applications.

TypeScript 10,928 1,002 Updated Dec 23, 2025

mit-han-lab / Block-Sparse-Attention

A sparse attention kernel supporting mix sparse patterns

C++ 411 39 Updated Dec 16, 2025

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 16,246 1,190 Updated Dec 23, 2025

datawhalechina / wow-agent

A simple and trans-platform agent framework and tutorial

Jupyter Notebook 197 42 Updated Dec 21, 2025

mit-han-lab / x-attention

[ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoring

Python 261 19 Updated Jul 6, 2025

MingyuJ666 / Rope_with_LLM

[ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concentrated in low-frequency dimensions across different attentio…

Python 86 1 Updated Jun 20, 2025

Zefan-Cai / KVCache-Factory

Unified KV Cache Compression Methods for Auto-Regressive Models

Python 1,291 159 Updated Jan 4, 2025

Fei Gao FeiGSSS

Lists (3)

Higher-Order GNN

开发工具

研究相关的库

Stars