Stars
Graphs that teach > graphs that impress. Turn any code into an interactive knowledge graph you can explore, search, and ask questions about. Works with Claude Code, Codex, Cursor, Copilot, Gemini C…
Interactive World Model papers organized by core research challenges.
FlashInfer: Kernel Library for LLM Serving
A paper list for spatial reasoning
The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".
A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.
A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.
An elegant Go board and SGF editor for a more civilized age.
Improve your Baduk skills by training with KataGo!
A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.
CLI tool for configuring and monitoring Claude Code
[TMLR 2025🔥] A survey for the autoregressive models in vision.
Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".
Awesome LLM pre-training resources, including data, frameworks, and methods.
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs
A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.
A PyTorch native platform for training generative AI models
Fully open reproduction of DeepSeek-R1
GPU programming related news and material links
React + Next.js template for research websites (for PhD students, researchers, etc)
A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.
Model Compression Toolbox for Large Language Models and Diffusion Models
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
Puzzles for learning Triton
Official Implementation for "ESCAPE: Encoding Super-keypoints for Category-Agnostic Pose Estimation", CVPR 2024.
A curated list of recent diffusion models for video generation, editing, and various other applications.