Skip to content
View bestchen97's full-sized avatar
  • Hangzhou, Zhejiang, China

Block or report bestchen97

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

231 stars written in Python
Clear filter

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 55,939 17,301 Updated Nov 2, 2025

Ultralytics YOLO 🚀

Python 48,336 9,325 Updated Nov 6, 2025

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 35,676 5,062 Updated Nov 6, 2025

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 35,377 3,886 Updated Apr 19, 2025

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 30,384 3,594 Updated Nov 6, 2025

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,357 3,426 Updated Oct 28, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 23,905 2,659 Updated Aug 12, 2024

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,189 1,665 Updated Sep 24, 2025

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 15,385 2,191 Updated Jul 24, 2024

Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).

Python 15,211 3,601 Updated Nov 6, 2025

End-to-End Object Detection with Transformers

Python 14,835 2,614 Updated Mar 12, 2024

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Python 14,709 3,415 Updated Nov 6, 2025

Image augmentation for machine learning experiments.

Python 14,695 2,472 Updated Jul 30, 2024

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 12,281 1,202 Updated Oct 28, 2025

Generate 3D objects conditioned on text or images

Python 12,125 1,048 Updated Jun 22, 2024

YOLOv3 in PyTorch > ONNX > CoreML > TFLite

Python 10,472 3,458 Updated Nov 2, 2025

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

Python 10,152 2,402 Updated Jun 8, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,431 734 Updated Sep 22, 2025

Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.

Python 9,330 509 Updated Nov 6, 2025

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 9,204 945 Updated Aug 12, 2024

Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System

Python 8,443 2,021 Updated May 13, 2024

More relighting!

Python 8,276 520 Updated Feb 20, 2025

Pytorch implementation of convolutional neural network visualization techniques

Python 8,151 1,508 Updated Jan 1, 2025

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 8,073 1,328 Updated Jul 23, 2024

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 7,824 595 Updated Jul 17, 2024

BoxMOT: Pluggable SOTA multi-object tracking modules modules for segmentation, object detection and pose estimation models

Python 7,770 1,855 Updated Oct 31, 2025

Your image is almost there!

Python 7,650 442 Updated Jul 26, 2024

Minimal PyTorch implementation of YOLOv3

Python 7,436 2,620 Updated Nov 17, 2024

A Collection of Variational Autoencoders (VAE) in PyTorch.

Python 7,417 1,176 Updated Mar 21, 2025

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Python 7,281 1,004 Updated Jul 3, 2024
Next