Stars
Unified Controllable Visual Generation Model
Pytorch implementation for Controllable Text-to-Image Generation.
[CVPR 2021] Pytorch implementation for TediGAN: Text-Guided Diverse Face Image Generation and Manipulation
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Code and Dataset for FS-COCO: Towards Understanding of Freehand Sketches of Common Objects in Context.
USketch: A crowdsourcing tool for SketchyScenes
[ECCV 2018] SketchyScene: Richly-Annotated Scene Sketches
[ICCV 2025] CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation
PyTorch implementation of RCG https://arxiv.org/abs/2312.03701
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer
Code for paper "SketchyCOCO: Image Generation from Freehand Scene Sketches" (CVPR 2020)
This project is a reimplementation of The Sketchy Database: Learning to Retrieve Badly Drawn Bunnies
[ICML 2023 Oral, NeurIPS 2023] Official implementations for paper: Customizable Image Synthesis with Multiple Subjects
[ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.
Python - 100天从新手到大师
[ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU needed. The user can ask a question and the system will make a mu…
A latent text-to-image diffusion model
[AAAI 2025] MultiBooth: This repo is the official implementation of "MultiBooth: Towards Generating All Your Concepts in an Image from Text"
MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models (NeurIPS 2024)
We propose VidSketch, the first method capable of generating high-quality video animations directly from any number of hand-drawn sketches and simple text prompts, bridging the divide between ordin…