bestchen97

Follow

Jim Chen bestchen97

Follow

Love life and to be better.

5 followers · 5 following

Hangzhou, Zhejiang, China

Lists (7)

Sort

Action Video

Action recognition and Video analysis

14 repositories

Event

45 repositories

Hand Pose

HumanPose

HumanPose resource

48 repositories

Shape

Study

Learning resource

36 repositories

Tools

Object Detection and CV tools

43 repositories

Starred repositories

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,312 531 Updated Nov 5, 2025

SageCao1125 / EOLO

[ICRA 2024] Chasing Day and Night: Towards Robust and Efficient All-Day Object Detection by an Event Camera

Python 22 5 Updated Sep 26, 2025

2U1 / Qwen-VL-Series-Finetune

An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.

Python 1,353 170 Updated Oct 22, 2025

facebookresearch / dinov3

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,117 547 Updated Nov 3, 2025

facebookresearch / SpinQuant

Code repo for the paper "SpinQuant LLM quantization with learned rotations"

Python 343 56 Updated Feb 14, 2025

bytedance / Sa2VA

Official Repo For "Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos"

Python 1,379 96 Updated Nov 4, 2025

OpenBMB / MiniCPM-V

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,184 1,665 Updated Sep 24, 2025

SensorsINI / v2e

V2E: From video frames to DVS events

Python 397 67 Updated Oct 17, 2025

Lynn0306 / DVS-Voltmeter

ECCV2022 'DVS-Voltmeter: Stochastic Process-based Event Simulator for Dynamic Vision Sensors'

Python 55 6 Updated Dec 29, 2022

OpenGVLab / VideoChat-Flash

VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling

Python 477 14 Updated Jun 13, 2025

ShareGPT4Omni / ShareGPT4Video

[NeurIPS 2024] An official implementation of "ShareGPT4Video: Improving Video Understanding and Generation with Better Captions"

Python 1,078 44 Updated Oct 9, 2024

Kwai-Keye / Keye

Python 694 12 Updated Nov 1, 2025

alibaba-damo-academy / PixelRefer

The code for PixelRefer & VideoRefer

Jupyter Notebook 299 16 Updated Oct 28, 2025

VectorSpaceLab / Video-XL

🔥🔥First-ever hour scale video understanding models

Python 568 36 Updated Jul 14, 2025

alibaba-mmai-research / TAdaConv

[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, video representation learning and temporal detection.

Python 240 33 Updated Aug 23, 2023

yingsen1 / UniMD

UniMD: Towards Unifying Moment retrieval and temporal action Detection

Python 54 1 Updated Jul 5, 2024

amazon-science / tubelet-transformer

This is an official implementation of TubeR: Tubelet Transformer for Video Action Detection

Python 86 20 Updated Apr 14, 2023

OpenGVLab / VideoMamba

[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding

Python 1,014 84 Updated Jul 6, 2024

OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 2,095 131 Updated Aug 7, 2025

mbzuai-oryx / Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

Python 1,462 122 Updated Aug 5, 2025

MCG-NJU / VideoMAE-Action-Detection

[NeurIPS 2022 Spotlight] VideoMAE for Action Detection

Python 68 8 Updated Feb 3, 2023

Blaizzy / mlx-vlm

MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

Python 1,808 200 Updated Nov 5, 2025

ziplab / LongVLM

Python 106 12 Updated Jul 30, 2024

zyayoung / Awesome-Video-LLMs

Explore VLM-Eval, a framework for evaluating Video Large Language Models, enhancing your video analysis with cutting-edge AI technology.

Python 36 1 Updated Jan 20, 2024

MCG-NJU / BasicTAD

BasicTAD: an Astounding RGB-Only Baselinefor Temporal Action Detection

Python 51 7 Updated Jun 10, 2023

MCG-NJU / EVAD

[ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement

Python 37 5 Updated Sep 27, 2023

MCG-NJU / ViT-TAD

[CVPR 2024] Adapting Short-Term Transformers for Action Detection in Untrimmed Videos

Python 12 1 Updated Jun 11, 2024

xlliu7 / TadTR

[TIP 2022] End-to-end Temporal Action Detection with Transformer

Python 157 13 Updated Feb 19, 2023

xlliu7 / E2E-TAD

[CVPR 2022] An Empirical Study of End-to-end Temporal Action Detection

Python 86 12 Updated Feb 19, 2023

UX-Decoder / Segment-Everything-Everywhere-All-At-Once

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,739 448 Updated Aug 19, 2024

Starred topics

video-understanding

action-recognition

action-detection

Machine learning

Computer vision

human-behavior-understanding

human-pose-estimation