Stars
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsens…
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
A PyTorch Implementation of FaceBoxes
face recognition algorithms in pytorch framework, including arcface, cosface, sphereface and so on
Unofficial PyTorch implementation of "FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence"
A Tensorflow implementation of RetinexNet
Bounding Box Regression with Uncertainty for Accurate Object Detection (CVPR'19)
pytorch version of SSD and it's enhanced methods such as RFBSSD,FSSD and RefineDet
The official PyTorch implementation of paper BBN: Bilateral-Branch Network with Cumulative Learning for Long-Tailed Visual Recognition
[CVPR 2022] Pre-Training 3D Point Cloud Transformers with Masked Point Modeling
The implementation of “Gradient Harmonized Single-stage Detector” published on AAAI 2019.
Implementation of DropBlock: A regularization method for convolutional networks in PyTorch.
Repository for Single Shot MultiBox Detector and its variants, implemented with pytorch, python3.
This repo is implemented based on detectron2 and centernet
A Pytorch-Lightning implementation of self-supervised algorithms
Implementation code of the paper: FishNet: A Versatile Backbone for Image, Region, and Pixel Level Prediction, NeurIPS 2018
A repository of common methods, datasets, and tasks for video research
A challenge to explore adversarial robustness of neural networks on CIFAR10.
Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)
Code for the CVPR 2018 Oral Paper "Deep Layer Aggregation"
Code for "Human Pose Regression with Residual Log-likelihood Estimation", ICCV 2021 Oral
Official PyTorch implementation of "A Comprehensive Overhaul of Feature Distillation" (ICCV 2019)
Code for Fast Training of Diffusion Models with Masked Transformers
Effective Video Augmentation Techniques for Training Convolutional Neural Networks
Public repo for Augmented Multiscale Deep InfoMax representation learning
ethanhe42 / softer-NMS
Forked from facebookresearch/DetectronBounding Box Regression with Uncertainty for Accurate Object Detection (CVPR'19)