Stars
Official inference repo for FLUX.1 models
A simple package to upload DL checkpoints to remote storage
Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization"
Easily create large video dataset from video urls
DataComp: In search of the next generation of multimodal datasets
Easily compute clip embeddings from video frames
Official repo for consistency models.
[ICML'23] StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis
Official codebase for the Paper “Retrieval-Augmented Diffusion Models”
Making large AI models cheaper, faster and more accessible
Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Robust Speech Recognition via Large-Scale Weak Supervision
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
VQVAEs, GumbelSoftmaxes and friends
Karras et al. (2022) diffusion models for PyTorch
Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch
Easily compute clip embeddings and build a clip retrieval system with them
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Approximate nearest neighbor search with product quantization on GPU in pytorch and cuda
[CVPR 2022] StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2
Comparison between Frechet Video Distance implementation from StyleGAN-V and the original paper