Skip to content
View ctgushiwei's full-sized avatar
  • Guangdong Shenzhen

Block or report ctgushiwei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
435 stars written in Python
Clear filter

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 6,848 478 Updated May 5, 2025

Example models using DeepSpeed

Python 6,709 1,109 Updated Oct 15, 2025

s1: Simple test-time scaling

Python 6,593 763 Updated Jun 25, 2025

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,581 586 Updated Oct 24, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,345 469 Updated Aug 7, 2024

OpenMMLab's next-generation platform for general 3D object detection.

Python 6,102 1,690 Updated Jul 10, 2024

NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥

Python 6,092 1,081 Updated Aug 8, 2024

Modeling, training, eval, and inference code for OLMo

Python 6,087 669 Updated Oct 24, 2025

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,981 567 Updated Feb 26, 2025

Simple Online Realtime Tracking with a Deep Association Metric

Python 5,956 1,561 Updated Mar 2, 2025

[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box

Python 5,771 1,054 Updated Jun 19, 2024

Solve Visual Understanding with Reinforced VLMs

Python 5,674 366 Updated Oct 21, 2025

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,479 286 Updated Nov 7, 2025

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Python 5,448 509 Updated Oct 25, 2025

OpenPCDet Toolbox for LiDAR-based 3D Object Detection.

Python 5,319 1,397 Updated Oct 8, 2025

A PyTorch Implementation of Single Shot MultiBox Detector

Python 5,225 1,744 Updated Dec 29, 2021

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 5,224 551 Updated Oct 30, 2025

Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"

Python 5,192 707 Updated Aug 23, 2024

Vision agent

Python 5,094 576 Updated Aug 30, 2025

Count the MACs / FLOPs of your PyTorch model.

Python 5,054 531 Updated Jul 8, 2024

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Python 4,669 776 Updated Nov 27, 2024

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4,667 319 Updated Aug 19, 2025

Torchreid: Deep learning person re-identification in PyTorch.

Python 4,657 1,186 Updated Jul 22, 2024
Python 4,373 417 Updated Sep 14, 2025

⛹️ Pytorch ReID: A tiny, friendly, strong pytorch implement of person re-id / vehicle re-id baseline. Tutorial 👉https://github.com/layumi/Person_reID_baseline_pytorch/tree/master/tutorial

Python 4,356 1,022 Updated Oct 24, 2025

[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving

Python 4,298 482 Updated Oct 29, 2025

Simple, online, and realtime tracking of multiple objects in a video sequence.

Python 4,281 1,137 Updated Nov 28, 2023

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。

Python 4,278 629 Updated Aug 30, 2025

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,224 408 Updated Oct 27, 2025