bestchen97

Jim Chen bestchen97

Love life and to be better.

5 followers · 5 following

Hangzhou, Zhejiang, China

Lists (7)

Sort

Starred repositories

Acly / krita-ai-diffusion

Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.

Python 9,331 509 Updated Nov 7, 2025

cvat-ai / cvat

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Python 14,713 3,418 Updated Nov 7, 2025

wkentaro / labelme

Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).

Python 15,213 3,602 Updated Nov 7, 2025

ultralytics / ultralytics

Ultralytics YOLO 🚀

Python 48,392 9,334 Updated Nov 7, 2025

Lightning-AI / pytorch-lightning

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 30,394 3,595 Updated Nov 7, 2025

opencv / opencv

Open Source Computer Vision Library

C++ 84,783 56,355 Updated Nov 7, 2025

facebookresearch / SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Python 7,162 1,285 Updated Nov 7, 2025

BindsNET / bindsnet

Simulation of spiking neural networks (SNNs) using PyTorch.

Python 1,635 340 Updated Nov 7, 2025

google-ai-edge / mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

C++ 31,850 5,601 Updated Nov 6, 2025

quic / aimet

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Python 2,492 424 Updated Nov 6, 2025

MasterHow / OneOcc

An official implementation for "OneOcc: Semantic Occupancy Prediction for Legged Robots with a Single Panoramic Camera"

15 Updated Nov 6, 2025

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 35,695 5,060 Updated Nov 6, 2025

Blaizzy / mlx-vlm

MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

Python 1,813 200 Updated Nov 5, 2025

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,324 535 Updated Nov 5, 2025

uzh-rpg / event-based_vision_resources

Event-based Vision Resources. Community effort to collect knowledge on event-based vision technology (papers, workshops, datasets, code, videos, etc)

3,360 713 Updated Nov 5, 2025

bytedance / Sa2VA

Official Repo For "Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos"

Python 1,383 96 Updated Nov 4, 2025

facebookresearch / dinov3

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,163 551 Updated Nov 3, 2025

ultralytics / yolov3

YOLOv3 in PyTorch > ONNX > CoreML > TFLite

Python 10,473 3,458 Updated Nov 2, 2025

ultralytics / yolov5

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 55,949 17,303 Updated Nov 2, 2025

pq-yang / MatAnyone

[CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation

Python 1,377 89 Updated Nov 2, 2025

Kwai-Keye / Keye

Python 693 12 Updated Nov 1, 2025

mikel-brostrom / boxmot

BoxMOT: Pluggable SOTA multi-object tracking modules modules for segmentation, object detection and pose estimation models

Python 7,775 1,855 Updated Oct 31, 2025

krahets / hello-algo

《Hello 算法》：动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新，English version in translation

Java 118,281 14,523 Updated Oct 30, 2025

lucidrains / vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,370 3,428 Updated Oct 28, 2025

Tencent-Hunyuan / Hunyuan3D-2

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 12,298 1,207 Updated Oct 28, 2025

alibaba-damo-academy / PixelRefer

The code for PixelRefer & VideoRefer

Jupyter Notebook 303 16 Updated Oct 28, 2025

facebookresearch / pytorchvideo

A deep learning library for video understanding research.

Python 3,495 427 Updated Oct 27, 2025

huggingface / nanoVLM

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,227 408 Updated Oct 27, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 16,071 1,275 Updated Oct 27, 2025

Guang000 / Awesome-Dataset-Distillation

A curated list of awesome papers on dataset distillation and related applications.

Jim Chen bestchen97

Lists (7)

Action Video

Event

Hand Pose

HumanPose

Shape

Study

Tools

Starred repositories

video-understanding

action-recognition

action-detection

Machine learning

Computer vision

human-behavior-understanding

human-pose-estimation