SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
-
Updated
Nov 4, 2025 - Python
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
LTX-Video Support for ComfyUI
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high fidelity and garment consistency for virtual dressing.
Rich-Text-to-Image Generation
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
(Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators
[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
[CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization
Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models" (ICLR2025)
Faster generation with text-to-image diffusion models.
[NeurIPS-2023] Annual Conference on Neural Information Processing Systems
🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)
[NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization
[NeurIPS 2024] Official implementation of "BELM: Bidirectional Explicit Linear Multi-step Sampler for Exact Inversion in Diffusion Models".
Official Pytorch repo of CVPR'23 and NeurIPS'23 papers on understanding replication in diffusion models.
🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"
[IJCAI 2025 (Oral)] Offical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models".
Add a description, image, and links to the text-to-image-generation topic page so that developers can more easily learn about it.
To associate your repository with the text-to-image-generation topic, visit your repo's landing page and select "manage topics."