Stars
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Stable Diffusion with Core ML on Apple Silicon
Wan: Open and Advanced Large-Scale Video Generative Models
HunyuanVideo: A Systematic Framework For Large Video Generation Model
SwinIR: Image Restoration Using Swin Transformer (official repository)
Google Drive Public File Downloader when Curl/Wget Fails
[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing
Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)
Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.
Using modified BiSeNet for face parsing in PyTorch
A sketch extractor for anime/illustration.
Designing a Practical Degradation Model for Deep Blind Image Super-Resolution (ICCV, 2021) (PyTorch) - We released the training code!
[CVPR 2022 Oral] Official repository for "MAXIM: Multi-Axis MLP for Image Processing". SOTA for denoising, deblurring, deraining, dehazing, and enhancement.
[IJCV 2022] Bridging Composite and Real: Towards End-to-end Deep Image Matting
Learning Image-adaptive 3D Lookup Tables for High Performance Photo Enhancement in Real-time
A Fast Deep Learning Model to Upsample Low Resolution Videos to High Resolution at 30fps
An official implementation of MobileStyleGAN in PyTorch
This repo contains code and a pre-trained model for clothes segmentation.
[CVPR2024] SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
🌕 [BMVC 2022] You Only Need 90K Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure Correction. SOTA for low light enhancement, 0.004 seconds try this for pre-p…
Official PyTorch implementation of "BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation" (NeurIPS 2021)
Official Code for ICCV 2021 paper "Towards Flexible Blind JPEG Artifacts Removal (FBCNN)"
All my self trained & released AI upscaling models. After gathering and applying over 600 different upscaling models, I learned how to train my own models, and these are the results.
Official implementation of the paper 'High-Resolution Photorealistic Image Translation in Real-Time: A Laplacian Pyramid Translation Network' in CVPR 2021
High-Resolution Image/Video Harmonization [ECCV 2022]
[ICLR 2025] HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
Imporved performance on facial image cartoonizaiton
PyTorch implementation of Accelerating the Super-Resolution Convolutional Neural Network (ECCV 2016)