Stars
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Machine Learning Engineering Open Book
手写实现李航《统计学习方法》书中全部算法
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
🚀 「大模型」1小时从0训练67M参数的视觉多模态VLM!🌏 Train a 67M-parameter VLM from scratch in just 1 hours!
Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.
A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
Official repository for BrickGPT, the first approach for generating physically stable toy brick models from text prompts.
Native Multimodal Models are World Learners
[ECCV 2022] Tensorial Radiance Fields, a novel approach to model and reconstruct radiance fields
[ECCV 2024 Best Paper Candidate & TPAMI 2025] PointLLM: Empowering Large Language Models to Understand Point Clouds
Causal video-action world model for generalist robot control
Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"
DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation
Official implementation of "Continuous Autoregressive Language Models"
Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation"
Implementation of rectified flow and some of its followup research / improvements in Pytorch
High-quality and editable surfel 3D Gaussian generation through native 3D diffusion (ICLR 2025)
[Arxiv] Discrete Diffusion in Large Language and Multimodal Models: A Survey
[CVPR23 Highlight] Implementation for Panoptic Lifting
[ICLR 2025] EdgeRunner: Auto-regressive Auto-encoder for Efficient Mesh Generation
[CVPR 2025] "Towards Universal Soccer Video Understanding".
[ICML 2025] Gaussian Mixture Flow Matching Models (GMFlow)