- All languages
- ActionScript
- Batchfile
- C
- C#
- C++
- CSS
- CoffeeScript
- Cuda
- Dart
- Dockerfile
- EJS
- FreeMarker
- GCC Machine Description
- Go
- Groovy
- HTML
- Haskell
- Java
- JavaScript
- Jsonnet
- Jupyter Notebook
- Kotlin
- Lua
- Makefile
- Markdown
- Nim
- Objective-C
- OpenSCAD
- PHP
- Perl
- PostScript
- Python
- Rich Text Format
- Ruby
- Rust
- SCSS
- Shell
- Swift
- TeX
- TypeScript
- Vim Script
- Vue
Starred repositories
Tile primitives for speedy kernels
An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation
Survey: A collection of AWESOME papers and resources on the large language model (LLM) related recommender system topics.
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…
PyTorch native quantization and sparsity for training and inference
Efficient Triton Kernels for LLM Training
Official code of "StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs".
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
<Foundations of Computer Vision> Book
Get started with building Fullstack Agents using Gemini 2.5 and LangGraph
Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache).
Finetune VITS and MMS using HuggingFace's tools
Collection of leaked system prompts
[ICML 2025] Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization
🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"
What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
《Designing Data-Intensive Application》DDIA 第一版 / 第二版 中文翻译
[CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/
Vim plugin for LLM-assisted code/text completion
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Machine Learning Engineering Open Book
🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch
Xiaomi Home Integration for Home Assistant