Stars
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
This is a collection of recent papers on reasoning in video generation models.
A Unified Visual Generator with Interleaved OmniModal Context
A curated list of recent diffusion models for video generation, editing, and various other applications.
[ICLR2026] Implementation of "S^2-Guidance: Stochastic Self Guidance for Training-Free Enhancement of Diffusion Models"
An inference-time, plug-and-play method for temporal control in multi-event generation
🔥An open-source survey of the latest video reasoning tasks, paradigms, and benchmarks.
Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give it a star 🌟 if you find it useful.
[CVPR 2026] Scaling Zero-Shot Reference-to-Video Generation
[CVPR 2026 Highlight] VideoCoF: Unified Video Editing with Temporal Reasoner
📖 This is a repository for organizing papers, codes, and other resources related to personalized video generation and editing.
Official repository of In-Context LoRA for Diffusion Transformers
DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation
[CVPR 2026, CVEU] Official codebase for FOCUS: Optimal Control for Multi-Entity World Modeling in Text-to-Image Generation
Video Diffusion Transformers are In-Context Learners
A collection of resources on personalized image generation.
[ICCV 2023] "TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition" (Official Implementation)
[ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
[🚀ICML 2025] "Taming Rectified Flow for Inversion and Editing" Using FLUX and HunyuanVideo for image and video editing!
The repository for the paper "Image Inversion: A Survey from GANs to Diffusion and Beyond".
official implementation of VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning (COLM 2024)
[ICLR 2025] Trajectory Attention For Fine-grained Video Motion Control
[WACV 2024] Training-Free Layout Control with Cross-Attention Guidance