-
Internal-Guidance Public
Forked from CVL-UESTC/Internal-GuidanceGuiding a Diffusion Transformer with the Internal Dynamics of Itself (IG)
Python MIT License UpdatedJan 2, 2026 -
DeCo Public
Forked from Zehong-Ma/DeCoOfficial repository for “DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation”
Python UpdatedNov 26, 2025 -
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
Python MIT License UpdatedNov 18, 2025 -
-
Awesome-Controllable-Video-Generation Public
Forked from mayuelala/Awesome-Controllable-Video-Generation[ArXiv 2025] A survey about controllable video generation: This repo is the official awesome of "Controllable video generation: A survey"
UpdatedNov 11, 2025 -
-
-
-
Scene-Splatter Public
Forked from shengjun-zhang/Scene-Splatter[CVPR 2025] Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model
C++ MIT License UpdatedJun 26, 2025 -
DiC Public
Forked from YuchuanTian/DiC[CVPR 2025] "DiC: Rethinking Conv3x3 Designs in Diffusion Models", a performant & speedy Conv3x3 diffusion model.
Python Other UpdatedJun 12, 2025 -
-
-
fid-metrics Public
Forked from npurson/fid-metricsA toolkit for computing Fréchet Inception Distance (FID) & Fréchet Video Distance (FVD) metrics.
Python MIT License UpdatedApr 6, 2025 -
CamI2V Public
Forked from ZGCTroy/CamI2Vofficial repo of paper for "CamI2V: Camera-Controlled Image-to-Video Diffusion Model"
Python MIT License UpdatedFeb 17, 2025 -
Infinity Public
Forked from FoundationVision/InfinityInfinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
Python MIT License UpdatedDec 25, 2024 -
VAR Public
Forked from FoundationVision/VAR[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…
Jupyter Notebook MIT License UpdatedDec 22, 2024 -
minimind Public
Forked from jingyaogong/minimind「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练!
Python Apache License 2.0 UpdatedDec 13, 2024 -
RAR Public
Forked from bytedance/1d-tokenizerThis repo contains the code for 1D tokenizer and generator
Jupyter Notebook Apache License 2.0 UpdatedNov 20, 2024 -
ViewCrafter Public
Forked from Drexubery/ViewCrafterOfficial implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
Python Apache License 2.0 UpdatedNov 6, 2024 -
hart Public
Forked from mit-han-lab/hartHART: Efficient Visual Generation with Hybrid Autoregressive Transformer
Python MIT License UpdatedOct 16, 2024 -
DnD-Transformer Public
Forked from chenllliang/DnD-TransformerSource code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation"
Python UpdatedOct 15, 2024 -
-
diff-transformer Public
Forked from microsoft/unilmLarge-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Python MIT License UpdatedOct 10, 2024 -
U-DiT Public
Forked from YuchuanTian/U-DiT[NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"
Python Other UpdatedSep 30, 2024 -
-
DiTFastAttn Public
Forked from thu-nics/DiTFastAttnJupyter Notebook MIT License UpdatedSep 22, 2024 -
VideoLDM Public
Forked from Stability-AI/generative-modelsGenerative Models by Stability AI
Python MIT License UpdatedSep 4, 2024 -
EpiDiff Public
Forked from huanngzh/EpiDiff[CVPR 2024] EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion
Python MIT License UpdatedAug 30, 2024 -
Open-Sora Public
Forked from hpcaitech/Open-SoraOpen-Sora: Democratizing Efficient Video Production for All
Python Apache License 2.0 UpdatedAug 9, 2024 -
MotionClone Public
Forked from LPengYang/MotionCloneOfficial implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation
Python UpdatedAug 7, 2024