Skip to content
View bestchen97's full-sized avatar
  • Hangzhou, Zhejiang, China

Block or report bestchen97

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.

Python 9,331 509 Updated Nov 7, 2025

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Python 14,713 3,418 Updated Nov 7, 2025

Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).

Python 15,213 3,602 Updated Nov 7, 2025

Ultralytics YOLO 🚀

Python 48,392 9,334 Updated Nov 7, 2025

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 30,394 3,595 Updated Nov 7, 2025

Open Source Computer Vision Library

C++ 84,783 56,355 Updated Nov 7, 2025

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Python 7,162 1,285 Updated Nov 7, 2025

Simulation of spiking neural networks (SNNs) using PyTorch.

Python 1,635 340 Updated Nov 7, 2025

Cross-platform, customizable ML solutions for live and streaming media.

C++ 31,850 5,601 Updated Nov 6, 2025

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Python 2,492 424 Updated Nov 6, 2025

An official implementation for "OneOcc: Semantic Occupancy Prediction for Legged Robots with a Single Panoramic Camera"

15 Updated Nov 6, 2025

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 35,695 5,060 Updated Nov 6, 2025

MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

Python 1,813 200 Updated Nov 5, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,324 535 Updated Nov 5, 2025

Event-based Vision Resources. Community effort to collect knowledge on event-based vision technology (papers, workshops, datasets, code, videos, etc)

3,360 713 Updated Nov 5, 2025

Official Repo For "Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos"

Python 1,383 96 Updated Nov 4, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,163 551 Updated Nov 3, 2025

YOLOv3 in PyTorch > ONNX > CoreML > TFLite

Python 10,473 3,458 Updated Nov 2, 2025

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 55,949 17,303 Updated Nov 2, 2025

[CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation

Python 1,377 89 Updated Nov 2, 2025
Python 693 12 Updated Nov 1, 2025

BoxMOT: Pluggable SOTA multi-object tracking modules modules for segmentation, object detection and pose estimation models

Python 7,775 1,855 Updated Oct 31, 2025

《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version in translation

Java 118,281 14,523 Updated Oct 30, 2025

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,370 3,428 Updated Oct 28, 2025

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 12,298 1,207 Updated Oct 28, 2025

The code for PixelRefer & VideoRefer

Jupyter Notebook 303 16 Updated Oct 28, 2025

A deep learning library for video understanding research.

Python 3,495 427 Updated Oct 27, 2025

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,227 408 Updated Oct 27, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 16,071 1,275 Updated Oct 27, 2025

A curated list of awesome papers on dataset distillation and related applications.

HTML 1,822 165 Updated Oct 27, 2025
Next