Stars
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
OpenMMLab Detection Toolbox and Benchmark
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Datasets, Transforms and Models specific to Computer Vision
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
A paper list of object detection using deep learning.
PyTorch package for the discrete VAE used for DALL·E.
Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.
Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
A faster pytorch implementation of faster r-cnn
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722
Most popular metrics used to evaluate object detection algorithms.
A data augmentations library for audio, image, text, and video.
3D ResNets for Action Recognition (CVPR 2018)
A highly efficient implementation of Gaussian Processes in PyTorch
PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)
OpenMMLab Self-Supervised Learning Toolbox and Benchmark
PyTorch code and models for V-JEPA self-supervised learning from video.
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…
A Simple and Versatile Framework for Object Detection and Instance Recognition