Stars
Official PyTorch implementation for "ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding"
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…
Official codebase for the Siggraph Asia 2025 paper AutoBrep: Autoregressive B-Rep Generation with Unified Topology and Geometry
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
This is the official implementation for paper "On Powerful Ways to Generate: Autoregression, Diffusion, and Beyond".
Echos is a headless, API-driven DAW engine. It’s the backend for building AI tools that automate the entire music production lifecycle.
Native Multimodal Models are World Learners
🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!
The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".
DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation
[3DV 2026] Official implementation of FollowMyHold
[Arxiv] Discrete Diffusion in Large Language and Multimodal Models: A Survey
Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"
A collection of papers on discrete diffusion models
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
[ICCV 2025] Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis
A curated list of awesome papers for reconstructing 4D spatial intelligence from video. (arXiv 2507.21045)
A curated list of awesome Neural Computer-Aided Design (CAD) papers.
MeshArt: Generating Articulated Meshes with Structure-Guided Transformers (CVPR2025)