Stars
[ACM CSUR 2025] Understanding World or Predicting Future? A Comprehensive Survey of World Models
[ECCV 2024] ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback.
[arXiv 2026] Official PyTorch Repository for "Coarse-Guided Visual Generation via Weighted h-Transform Sampling"
[arXiv 2026] Bi-Anchor Interpolation Solver for Accelerating Generative Modeling; Paper link: https://arxiv.org/abs/2601.21542
⚡ Dynamically generated stats for your github readmes
Official Pytorch Implementation for "Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising"
[CVPR2026] Long-horizon, spatially consistent video generation enabled by persistent 3D scene point clouds and dynamic-static disentanglement.
[NeurIPS 2025] The Pytorch Implementation of NoOp
[CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
[ICLR26] Official implementation of Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling
Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
Official PyTorch implementation of the paper "FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing"
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)
[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of the Open World"
Pytorch Implementation of FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing (ICLR 2024)
[NeurIPS 2025] Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation
Wan: Open and Advanced Large-Scale Video Generative Models
Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"
[CVPR 2025] IterIS: Iterative Inference-Solving Alignment for LoRA Merging
[arXiv 2026] The Pytorch Implementation of ''Target-aware Image Editing via Cycle-consistent Constraints''
[ICLR 2026] Official implementation of the paper "Exploring Cross-Modal Flows for Few-Shot Learning".
Official implementation of AnimateDiff.
[ICLR 2026] GIR-Bench: Versatile Benchmark for Generating Images with Reasoning
A list of works on video generation towards world model