Stars
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
Builder and index for PyTorch packages
VITRA: Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos
A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.
This repository contains the code of the paper "IC-World: In-Context Generation for Shared World Modeling".
A toolbox for spectral compressive imaging reconstruction including MST (CVPR 2022), CST (ECCV 2022), DAUHST (NeurIPS 2022), BiSCI (NeurIPS 2023), HDNet (CVPR 2022), MST++ (CVPRW 2022), etc.
The repository for CVPR 2022 Paper "Neural 3D Video Synthesis"
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Code of π^3: Permutation-Equivariant Visual Geometry Learning
An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation
A unified inference and post-training framework for accelerated video generation.
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
Open3D: A Modern Library for 3D Data Processing
4-steps distilled version of Wan2.2-TI2V-5B
GoatWu / Self-Forcing-Plus
Forked from guandeh17/Self-ForcingUnofficial extension implementation of Self-Forcing to support I2V && 14B training.
The most widely used, high performance Minecraft server that aims to fix gameplay and mechanics inconsistencies
Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical Needs
djbrown79 / Voyager
Forked from MineDojo/VoyagerAn Open-Ended Embodied Agent with Large Language Models
An Open-Ended Embodied Agent with Large Language Models
Minecraft AI with LLMs+Mineflayer
MineRL Competition for Sample Efficient Reinforcement Learning - Python Package
Building Open-Ended Embodied Agents with Internet-Scale Knowledge