Skip to content
View bestchen97's full-sized avatar
  • Hangzhou, Zhejiang, China

Block or report bestchen97

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,312 531 Updated Nov 5, 2025

[ICRA 2024] Chasing Day and Night: Towards Robust and Efficient All-Day Object Detection by an Event Camera

Python 22 5 Updated Sep 26, 2025

An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.

Python 1,353 170 Updated Oct 22, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,117 547 Updated Nov 3, 2025

Code repo for the paper "SpinQuant LLM quantization with learned rotations"

Python 343 56 Updated Feb 14, 2025

Official Repo For "Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos"

Python 1,379 96 Updated Nov 4, 2025

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,184 1,665 Updated Sep 24, 2025

V2E: From video frames to DVS events

Python 397 67 Updated Oct 17, 2025

ECCV2022 'DVS-Voltmeter: Stochastic Process-based Event Simulator for Dynamic Vision Sensors'

Python 55 6 Updated Dec 29, 2022

VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling

Python 477 14 Updated Jun 13, 2025

[NeurIPS 2024] An official implementation of "ShareGPT4Video: Improving Video Understanding and Generation with Better Captions"

Python 1,078 44 Updated Oct 9, 2024
Python 694 12 Updated Nov 1, 2025

The code for PixelRefer & VideoRefer

Jupyter Notebook 299 16 Updated Oct 28, 2025

🔥🔥First-ever hour scale video understanding models

Python 568 36 Updated Jul 14, 2025

[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, video representation learning and temporal detection.

Python 240 33 Updated Aug 23, 2023

UniMD: Towards Unifying Moment retrieval and temporal action Detection

Python 54 1 Updated Jul 5, 2024

This is an official implementation of TubeR: Tubelet Transformer for Video Action Detection

Python 86 20 Updated Apr 14, 2023

[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding

Python 1,014 84 Updated Jul 6, 2024

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 2,095 131 Updated Aug 7, 2025

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

Python 1,462 122 Updated Aug 5, 2025

[NeurIPS 2022 Spotlight] VideoMAE for Action Detection

Python 68 8 Updated Feb 3, 2023

MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

Python 1,808 200 Updated Nov 5, 2025
Python 106 12 Updated Jul 30, 2024

Explore VLM-Eval, a framework for evaluating Video Large Language Models, enhancing your video analysis with cutting-edge AI technology.

Python 36 1 Updated Jan 20, 2024

BasicTAD: an Astounding RGB-Only Baselinefor Temporal Action Detection

Python 51 7 Updated Jun 10, 2023

[ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement

Python 37 5 Updated Sep 27, 2023

[CVPR 2024] Adapting Short-Term Transformers for Action Detection in Untrimmed Videos

Python 12 1 Updated Jun 11, 2024

[TIP 2022] End-to-end Temporal Action Detection with Transformer

Python 157 13 Updated Feb 19, 2023

[CVPR 2022] An Empirical Study of End-to-end Temporal Action Detection

Python 86 12 Updated Feb 19, 2023

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,739 448 Updated Aug 19, 2024
Next