Skip to content
View KimSoybean's full-sized avatar
  • JD AI Research
  • Shenzhen, China

Block or report KimSoybean

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
141 stars written in Python
Clear filter

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 35,683 5,062 Updated Nov 6, 2025

OpenMMLab Detection Toolbox and Benchmark

Python 31,947 9,781 Updated Aug 21, 2024

LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …

Python 24,439 6,554 Updated Jun 7, 2024

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,360 3,427 Updated Oct 28, 2025

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Python 23,529 9,768 Updated Sep 1, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,815 2,665 Updated Jul 3, 2025

Datasets, Transforms and Models specific to Computer Vision

Python 17,284 7,170 Updated Nov 6, 2025

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 15,384 2,191 Updated Jul 24, 2024

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Python 15,190 1,704 Updated Jun 25, 2025

A paper list of object detection using deep learning.

Python 11,430 2,772 Updated Feb 12, 2024

PyTorch package for the discrete VAE used for DALL·E.

Python 10,875 1,905 Updated Jan 31, 2024

Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.

Python 9,379 2,480 Updated Feb 16, 2023

Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.

Python 9,112 1,828 Updated Apr 22, 2022

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 8,074 1,328 Updated Jul 23, 2024

A faster pytorch implementation of faster r-cnn

Python 7,842 2,322 Updated May 20, 2022

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Python 7,163 1,285 Updated Oct 27, 2025

Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.

Python 6,941 2,879 Updated Jan 15, 2025

95.47% on CIFAR10 with PyTorch

Python 6,294 2,174 Updated Feb 24, 2023

The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.

Python 6,248 665 Updated Aug 17, 2025

PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722

Python 5,090 804 Updated Sep 30, 2025

Most popular metrics used to evaluate object detection algorithms.

Python 5,085 1,036 Updated Jun 29, 2025

A data augmentations library for audio, image, text, and video.

Python 5,056 310 Updated Oct 31, 2025

Official DeiT repository

Python 4,278 584 Updated Mar 15, 2024

3D ResNets for Action Recognition (CVPR 2018)

Python 4,022 935 Updated Jan 20, 2021

A highly efficient implementation of Gaussian Processes in PyTorch

Python 3,786 575 Updated Oct 14, 2025

PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)

Python 3,374 556 Updated Dec 26, 2023

OpenMMLab Self-Supervised Learning Toolbox and Benchmark

Python 3,287 442 Updated Jun 25, 2023

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,257 322 Updated Feb 27, 2025

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…

Python 3,102 425 Updated May 8, 2024

A Simple and Versatile Framework for Object Detection and Instance Recognition

Python 3,088 485 Updated Sep 23, 2021
Next