Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Models and examples built with TensorFlow
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Real-time face swap for PC streaming or video calls
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
PyTorch implementations of Generative Adversarial Networks.
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Ongoing research training transformer models at scale
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
Style transfer, deep learning, feature transform
🐍 Geometric Computer Vision Library for Spatial AI
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activelo…
Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.
PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.
Real-Time High-Resolution Background Matting
OpenMMLab Pose Estimation Toolbox and Benchmark.
Synthesizing and manipulating 2048x1024 images with conditional GANs
Official PyTorch implementation of StyleGAN3
SwinIR: Image Restoration Using Swin Transformer (official repository)
StyleGAN2-ADA - Official PyTorch implementation
Unofficial implementation of Image Super-Resolution via Iterative Refinement by Pytorch
Scenic: A Jax Library for Computer Vision Research and Beyond
StarGAN v2 - Official PyTorch Implementation (CVPR 2020)