Skip to content
View drewZZzz6's full-sized avatar

Block or report drewZZzz6

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 4 1 Updated Aug 20, 2025

将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调

Python 479 48 Updated Sep 8, 2025

[ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer

Python 1,867 140 Updated Jul 3, 2025
1 Updated Jun 12, 2025

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,445 434 Updated Oct 27, 2025

Code from my thesis Articulated 3D Hand from a Single RGB Image, later published as Monocular 3D Hand Pose Estimation with Implicit Camera Alignment. Also contains notes from an earlier study on 2D…

Python 10 1 Updated Jun 16, 2025

[NeurIPS 2025] YOLOv12: Attention-Centric Real-Time Object Detectors

Python 2,710 389 Updated Oct 3, 2025

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,971 1,170 Updated Dec 19, 2025

[CVPR 2024 Highlight] Logit Standardization in Knowledge Distillation

Jupyter Notebook 386 29 Updated Oct 9, 2024

Strong and Open Vision Language Assistant for Mobile Devices

Python 1,316 85 Updated Apr 15, 2024

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 48,504 6,846 Updated Dec 14, 2025

PyTorch reimplementation of hand-biomechanical-constraints (ECCV2020)

Python 80 10 Updated Mar 29, 2021

[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.

Python 3,221 369 Updated Sep 7, 2025

YOLOv6: a single-stage object detection framework dedicated to industrial applications.

Jupyter Notebook 1 Updated Apr 29, 2023

Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).

Python 2,297 143 Updated Jun 7, 2023

HAnd Gesture Recognition Image Dataset

Python 919 128 Updated Feb 27, 2025

repo for NIMBLE: A Non-rigid Hand Model with Bones and Muscles

Python 137 20 Updated May 23, 2024

基于深度学习的肿瘤辅助诊断系统,以图像分割为核心,利用人工智能完成肿瘤区域的识别勾画并提供肿瘤区域的特征来辅助医生进行诊断。有完整的模型构建、后端架设、工业级部署和前端访问功能。TensorRT、PyTorch 、OpenCV 、Flask、Vue

Python 642 114 Updated Jan 31, 2025