Stars
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
OpenMMLab Detection Toolbox and Benchmark
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Datasets, Transforms and Models specific to Computer Vision
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
A paper list of object detection using deep learning.
PyTorch package for the discrete VAE used for DALL·E.
Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.
A faster pytorch implementation of faster r-cnn
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722
Most popular metrics used to evaluate object detection algorithms.
A data augmentations library for audio, image, text, and video.
3D ResNets for Action Recognition (CVPR 2018)
A highly efficient implementation of Gaussian Processes in PyTorch
PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)
OpenMMLab Self-Supervised Learning Toolbox and Benchmark
PyTorch code and models for V-JEPA self-supervised learning from video.
A Simple and Versatile Framework for Object Detection and Instance Recognition
SNIPER / AutoFocus is an efficient multi-scale object detection training / inference algorithm
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
PyTorch implementation of Contrastive Learning methods