Skip to content
View bestchen97's full-sized avatar
  • Hangzhou, Zhejiang, China

Block or report bestchen97

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

MATLAB code for paper "Event-Based Motion Segmentation by Motion Compensation"

MATLAB 35 4 Updated Jul 29, 2020

An official implementation for "OneOcc: Semantic Occupancy Prediction for Legged Robots with a Single Panoramic Camera"

15 Updated Nov 6, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,323 534 Updated Nov 5, 2025

[ICRA 2024] Chasing Day and Night: Towards Robust and Efficient All-Day Object Detection by an Event Camera

Python 22 5 Updated Sep 26, 2025

An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.

Python 1,361 169 Updated Oct 22, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,158 551 Updated Nov 3, 2025

Code repo for the paper "SpinQuant LLM quantization with learned rotations"

Python 344 56 Updated Feb 14, 2025

Official Repo For "Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos"

Python 1,381 96 Updated Nov 4, 2025

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,193 1,664 Updated Sep 24, 2025

V2E: From video frames to DVS events

Python 397 67 Updated Oct 17, 2025

ECCV2022 'DVS-Voltmeter: Stochastic Process-based Event Simulator for Dynamic Vision Sensors'

Python 55 6 Updated Dec 29, 2022

VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling

Python 477 14 Updated Jun 13, 2025

[NeurIPS 2024] An official implementation of "ShareGPT4Video: Improving Video Understanding and Generation with Better Captions"

Python 1,078 44 Updated Oct 9, 2024
Python 693 12 Updated Nov 1, 2025

The code for PixelRefer & VideoRefer

Jupyter Notebook 303 16 Updated Oct 28, 2025

🔥🔥First-ever hour scale video understanding models

Python 568 36 Updated Jul 14, 2025

[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, video representation learning and temporal detection.

Python 241 33 Updated Aug 23, 2023

UniMD: Towards Unifying Moment retrieval and temporal action Detection

Python 54 1 Updated Jul 5, 2024

This is an official implementation of TubeR: Tubelet Transformer for Video Action Detection

Python 87 20 Updated Apr 14, 2023

[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding

Python 1,019 84 Updated Jul 6, 2024

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 2,097 131 Updated Aug 7, 2025

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

Python 1,463 123 Updated Aug 5, 2025

[NeurIPS 2022 Spotlight] VideoMAE for Action Detection

Python 68 8 Updated Feb 3, 2023

MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

Python 1,813 200 Updated Nov 5, 2025
Python 106 12 Updated Jul 30, 2024

Explore VLM-Eval, a framework for evaluating Video Large Language Models, enhancing your video analysis with cutting-edge AI technology.

Python 36 1 Updated Jan 20, 2024

BasicTAD: an Astounding RGB-Only Baselinefor Temporal Action Detection

Python 51 7 Updated Jun 10, 2023

[ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement

Python 37 5 Updated Sep 27, 2023

[CVPR 2024] Adapting Short-Term Transformers for Action Detection in Untrimmed Videos

Python 12 1 Updated Jun 11, 2024
Next