Highlights
- Pro
Stars
🤗 ml-intern: an open-source ML engineer that reads papers, trains models, and ships ML models
Learning notes for understanding modern Machine Learning System.
FMMS Kernel: Fused Matrix-Multiplication + Sampling
Autonomous experiment loop extension for pi
Use Garry Tan's exact Claude Code setup: 23 opinionated tools that serve as CEO, Designer, Eng Manager, Release Manager, Doc Engineer, and QA
My learning notes for ML SYS.
~950 line, minimal, extensible LLM inference engine built from scratch.
SGLang is a high-performance serving framework for large language models and multimodal models.
Hundreds of models & providers. One command to find what runs on your hardware.
Nightshift uses your leftover Claude / Codex budget to surprise you with useful PRs. Love them or leave them.
MCP server for the X (Twitter) API -- give AI agents the ability to post, search, read, and engage on X
A PyTorch-native inference engine with cache, parallelism, quantization for Diffusion Transformers.
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
The most powerful local music generation model that outperforms almost all commercial alternatives, supporting Mac, AMD, Intel, and CUDA devices.
FASHN VTON v1.5: Efficient Maskless Virtual Try-On in Pixel Space
MoE training for Me and You and maybe other people
Latent Collaboration in Multi-Agent Systems
A calm, CLI-native way to semantically grep everything, like code, images, pdfs and more.
[NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning
Mirage Persistent Kernel: Compiling LLMs into a MegaKernel