Stars
[TMLR 2025] Efficient Diffusion Models: A Survey
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
[NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. 🔥 [Paper + Code + Demo]
Reading list for research topics in multimodal machine learning
A collection of resources on controllable generation with text-to-image diffusion models.
collection of diffusion model papers categorized by their subareas
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
A collection of resources and papers on Diffusion Models
Efficient vision foundation models for high-resolution generation and perception.
Diffusion model papers, survey, and taxonomy
High-Resolution Image Synthesis with Latent Diffusion Models
Learn OpenCV : C++ and Python Examples
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
This may be the simplest implement of DDPM. You can directly run Main.py to train the UNet on CIFAR-10 dataset and see the amazing process of denoising.
A latent text-to-image diffusion model
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
A collection of awesome text-to-image generation studies.
The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)