Stars
A latent text-to-image diffusion model
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
This repository contains the source code for the paper First Order Motion Model for Image Animation
High-Resolution Image Synthesis with Latent Diffusion Models
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
Taming Transformers for High-Resolution Image Synthesis
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
A high performance implementation of HDBSCAN clustering.
Kandinsky 2 — multilingual text2image latent diffusion model
Easily compute clip embeddings and build a clip retrieval system with them
Discovering Interpretable GAN Controls [NeurIPS 2020]
Generate images from texts. In Russian
Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion
ktrain is a Python library that makes deep learning and AI more accessible and easier to apply
Tools to train a generative model on arbitrary audio samples
Graph Transformer Networks (Authors' PyTorch implementation for the NeurIPS 19 paper)
yzhou359 / MakeItTalk
Forked from adobe-research/MakeItTalkThe official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-…
Official Implementation of Paella https://arxiv.org/abs/2211.07292v2
Reference code for "Motion-supervised Co-Part Segmentation" paper
Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]
A repo for the maintenance of the Colab version of stable-diffusion-webui repo