Stars
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Beautiful charts for iOS/tvOS/OSX! The Apple side of the crossplatform MPAndroidChart.
A curated list of awesome computer vision resources
Stable Diffusion with Core ML on Apple Silicon
Wan: Open and Advanced Large-Scale Video Generative Models
HunyuanVideo: A Systematic Framework For Large Video Generation Model
LAVIS - A One-stop Library for Language-Vision Intelligence
Run Stable Diffusion on Mac natively
SwinIR: Image Restoration Using Swin Transformer (official repository)
Google Drive Public File Downloader when Curl/Wget Fails
[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing
Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)
Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.
Using modified BiSeNet for face parsing in PyTorch
This is the repo for our new project Highly Accurate Dichotomous Image Segmentation
A sketch extractor for anime/illustration.
[CVPR 2022] Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer
This is a resouce list for low light image enhancement
Designing a Practical Degradation Model for Deep Blind Image Super-Resolution (ICCV, 2021) (PyTorch) - We released the training code!
[CVPR 2022 Oral] Official repository for "MAXIM: Multi-Axis MLP for Image Processing". SOTA for denoising, deblurring, deraining, dehazing, and enhancement.
[IJCV 2022] Bridging Composite and Real: Towards End-to-end Deep Image Matting
Learning Image-adaptive 3D Lookup Tables for High Performance Photo Enhancement in Real-time
A Fast Deep Learning Model to Upsample Low Resolution Videos to High Resolution at 30fps
An official implementation of MobileStyleGAN in PyTorch
An open source library that lets your users draw on things - mark up images with text, shapes, etc.