Highlights
- Pro
Stars
A suite of image and video neural tokenizers
Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen
Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer
Official Implementation of "Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models"
Ablating Concepts in Text-to-Image Diffusion Models (ICCV 2023)
Official implementation for Estimating the Optimal Covariance with Imperfect Mean in Diffusion Probabilistic Models (ICML 2022), and a reimplementation of Analytic-DPM: an Analytic Estimate of the …
Official code for "Maximum Likelihood Training of Score-Based Diffusion Models", NeurIPS 2021 (spotlight)
Conditional diffusion model to generate MNIST. Minimal script. Based on 'Classifier-Free Diffusion Guidance'.
Implement a MNIST(also minimal) version of denoising diffusion probabilistic model from scratch.The model only has 4.55MB.
Easily compute clip embeddings and build a clip retrieval system with them
Using Low-rank adaptation to quickly fine-tune diffusion models.
Unofficial PyTorch implementation of Denoising Diffusion Probabilistic Models
Unofficial PyTorch Implementation of Denoising Diffusion Probabilistic Models (DDPM)
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
PyTorch reimplementation of Diffusion Models
Recent Transformer-based CV and related works.
MIT EECS Thesis Proposal Template
ManimML is a project focused on providing animations and visualizations of common machine learning concepts with the Manim Community Library.
Code for Debiasing Vision-Language Models via Biased Prompts
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
An open source implementation of CLIP.
Pytorch implementation for t-SNE with cuda to accelerate
[ICML2023] InfoOT: Information Maximizing Optimal Transport
NeurIPS 2022: Tree Mover’s Distance: Bridging Graph Metrics and Stability of Graph Neural Networks
A latent text-to-image diffusion model
Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings