Stars
Official inference repo for FLUX.1 models
Train transformer language models with reinforcement learning.
Lets make video diffusion practical!
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
🐧 在 Linux 上提供一套完整的 Clash / Mihomo(Clash Meta) 代理与管理面板
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
MAGI-1: Autoregressive Video Generation at Scale
[ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds
[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run!
Official repository of In-Context LoRA for Diffusion Transformers
An intuitive GUI for GLIGEN that uses ComfyUI in the backend
📹 A more flexible framework that can generate videos at any resolution and creates videos from images.
Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)
Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment
[ICCV 2025] STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution
ComfyUI nodes to use segment-anything-2
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
[ArXiv 2025] A survey about controllable video generation: This repo is the official awesome of "Controllable video generation: A survey"
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
A Collection of Papers and Codes for CVPR2025/ICCV2025/CVPR2024/ECCV2024 AIGC
[ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥
[CVPR 2024 Highlight] MIGC and [TPAMI 2024] MIGC++ (Official Implementation)
[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"
Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"
[ICCV 2025] Official pytorch implementation of "FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors"
[CVPRW oral 2022] MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment