bestchen97

Follow

Jim Chen bestchen97

Follow

Love life and to be better.

5 followers · 5 following

Hangzhou, Zhejiang, China

Lists (7)

Sort

Action Video

Action recognition and Video analysis

14 repositories

Event

45 repositories

Hand Pose

HumanPose

HumanPose resource

48 repositories

Shape

Study

Learning resource

36 repositories

Tools

Object Detection and CV tools

43 repositories

Starred repositories

231 stars written in Python

facebookresearch / SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Python 7,163 1,285 Updated Nov 7, 2025

open-mmlab / mmpose

OpenMMLab Pose Estimation Toolbox and Benchmark.

Python 7,028 1,408 Updated Aug 4, 2025

DepthAnything / Depth-Anything-V2

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 6,965 691 Updated Jan 22, 2025

gaomingqi / Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,872 504 Updated May 31, 2024

openai / point-e

Point cloud diffusion for 3D model synthesis

Python 6,817 796 Updated Jul 4, 2024

princeton-vl / infinigen

Infinite Photorealistic Worlds using Procedural Generation

Python 6,694 541 Updated Oct 18, 2025

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,345 469 Updated Aug 7, 2024

SmirkCao / Lihang

Statistical learning methods, 统计学习方法(第2版)[李航] [笔记, 代码, notebook, 参考文献, Errata, lihang]

Python 6,240 1,605 Updated Aug 5, 2023

RangiLyu / nanodet

NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥

Python 6,093 1,081 Updated Aug 8, 2024

AILab-CVC / YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,981 567 Updated Feb 26, 2025

canopyai / Orpheus-TTS

Towards Human-Sounding Speech

Python 5,700 484 Updated May 6, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 5,675 366 Updated Oct 21, 2025

facebookresearch / sapiens

High-resolution models for human tasks.

Python 5,199 304 Updated Nov 18, 2024

deepseek-ai / DeepSeek-VL2

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 5,116 1,790 Updated Feb 26, 2025

open-mmlab / mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Python 4,801 1,317 Updated Aug 14, 2024

UX-Decoder / Segment-Everything-Everywhere-All-At-Once

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,739 449 Updated Aug 19, 2024

STVIR / pysot

SenseTime Research platform for single object tracking, implementing algorithms like SiamRPN and SiamMask.

Python 4,558 1,111 Updated Jun 22, 2025

yanx27 / Pointnet_Pointnet2_pytorch

PointNet and PointNet++ implemented by pytorch (pure python) and on ModelNet, ShapeNet and S3DIS.

Python 4,536 989 Updated Apr 24, 2024

zju3dv / EasyMocap

Make human motion capture easier.

Python 4,320 523 Updated Feb 26, 2025

facebookresearch / deit

Official DeiT repository

Python 4,278 584 Updated Mar 15, 2024

huggingface / nanoVLM

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,224 408 Updated Oct 27, 2025

fundamentalvision / BEVFormer

[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.

Python 4,114 657 Updated Aug 15, 2024

facebookresearch / VideoPose3D

Efficient 3D human pose estimation in video using 2D keypoint trajectories

Python 3,929 777 Updated Dec 10, 2022

open-mmlab / mmtracking

OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.

Python 3,812 614 Updated Sep 19, 2023

princeton-vl / RAFT

Python 3,803 655 Updated Aug 24, 2025

open-mmlab / mmpretrain

OpenMMLab Pre-training Toolbox and Benchmark

Python 3,761 1,106 Updated Nov 1, 2024

JDAI-CV / fast-reid

SOTA Re-identification Methods and Toolbox

Python 3,753 863 Updated Jul 30, 2024

charlesq34 / pointnet2

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

Python 3,535 929 Updated Aug 26, 2022

facebookresearch / pytorchvideo

A deep learning library for video understanding research.

Python 3,495 427 Updated Oct 27, 2025

visionml / pytracking

Visual tracking library based on PyTorch.

Python 3,445 612 Updated Aug 8, 2024

Starred topics

video-understanding

action-recognition

action-detection

Machine learning

Computer vision

human-behavior-understanding

human-pose-estimation