Skip to content
View KimSoybean's full-sized avatar
  • JD AI Research
  • Shenzhen, China

Block or report KimSoybean

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Datasets, Transforms and Models specific to Computer Vision

Python 17,288 7,171 Updated Nov 8, 2025

A highly efficient implementation of Gaussian Processes in PyTorch

Python 3,786 575 Updated Nov 8, 2025

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Python 7,162 1,285 Updated Nov 7, 2025

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,957 131 Updated Nov 7, 2025

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 35,711 5,061 Updated Nov 6, 2025

A data augmentations library for audio, image, text, and video.

Python 5,057 309 Updated Oct 31, 2025

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,375 3,430 Updated Oct 28, 2025

solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning

Python 1,526 196 Updated Oct 20, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,498 41 Updated Oct 15, 2025

This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation"

Python 238 9 Updated Oct 12, 2025

PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722

Python 5,090 804 Updated Sep 30, 2025

Acceptance rates for the major AI conferences

Jupyter Notebook 4,658 312 Updated Sep 23, 2025

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Python 23,534 9,768 Updated Sep 1, 2025

The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.

Python 6,249 665 Updated Aug 17, 2025

This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.

Python 1,383 66 Updated Aug 4, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,818 2,667 Updated Jul 3, 2025

单阶段通用目标检测器

Python 1,976 501 Updated Jul 2, 2025

Most popular metrics used to evaluate object detection algorithms.

Python 5,084 1,036 Updated Jun 29, 2025

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Python 15,189 1,704 Updated Jun 25, 2025

[MICCAI 2019 Young Scientist Award] [MEDIA 2020 Best Paper Award] Models Genesis

Jupyter Notebook 769 144 Updated Jun 22, 2025

VMZ: Model Zoo for Video Modeling

Python 1,050 159 Updated Jun 17, 2025

Official Implementation of Diffusion Step Annealing (DiSA) in Autoregressive Image Generation

Jupyter Notebook 141 1 Updated May 27, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,471 542 Updated May 18, 2025
Python 628 49 Updated Apr 12, 2025

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 1,068 58 Updated Apr 1, 2025

[CVPR 2022] Pre-Training 3D Point Cloud Transformers with Masked Point Modeling

Python 652 72 Updated Mar 22, 2025

Codebase for evaluation of deep generative models as presented in Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models

Jupyter Notebook 195 18 Updated Mar 3, 2025

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,259 322 Updated Feb 27, 2025

Deep Learning Visualization Toolkit(『飞桨』深度学习可视化工具 )

HTML 4,859 630 Updated Jan 22, 2025

Official Implementation for "Consistency Flow Matching: Defining Straight Flows with Velocity Consistency"

Python 246 11 Updated Jan 17, 2025
Next