Stars
🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.
Official PyTorch implementation of PF-RPN (CVPR 2026).
Elevate your AI research writing, no more tedious polishing ✨
CADAM is the open source text-to-CAD web application
The Github repo for our survey paper: "Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models"
LLaDA2.0 is the diffusion language model series developed by InclusionAI team, Ant Group.
Official implementation of "Continuous Autoregressive Language Models"
Causal video-action world model for generalist robot control
Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation"
[CVPR 2026] 3D Motion Reconstruction for 4D Synthesis
[ICLR 2026] Official PyTorch implementation for "ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding"
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…
Official codebase for the Siggraph Asia 2025 paper AutoBrep: Autoregressive B-Rep Generation with Unified Topology and Geometry
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
This is the official implementation for paper "On Powerful Ways to Generate: Autoregression, Diffusion, and Beyond".
Echos is a headless, API-driven DAW engine. It’s the backend for building AI tools that automate the entire music production lifecycle.
Native Multimodal Models are World Learners
🚀 「大模型」1小时从0训练67M参数的视觉多模态VLM!🌏 Train a 67M-parameter VLM from scratch in just 1 hours!
The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".
DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation