Skip to content
View RizwanMunawar's full-sized avatar
❄️
Building something cool in computer vision 🚀
❄️
Building something cool in computer vision 🚀

Organizations

@ultralytics

Block or report RizwanMunawar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 180,348 46,186 Updated Dec 16, 2025

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 87,739 10,043 Updated Dec 17, 2025

An extremely fast Python package and project manager, written in Rust.

Rust 75,086 2,347 Updated Dec 17, 2025

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 56,385 17,367 Updated Dec 17, 2025

Ultralytics YOLO 🚀

Python 50,039 9,662 Updated Dec 17, 2025

🕸️ Web apps in pure Python 🐍

Python 27,823 1,667 Updated Dec 17, 2025

Label Studio is a multi-type data labeling and annotation tool with standardized output format

TypeScript 25,856 3,262 Updated Dec 17, 2025

FoundationDB - the open source, distributed, transactional key-value store

C++ 16,014 1,455 Updated Dec 15, 2025

Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

Python 13,981 3,004 Updated Oct 10, 2025

YOLOv3 in PyTorch > ONNX > CoreML > TFLite

Python 10,523 3,451 Updated Dec 17, 2025

tiny vision language model

Python 9,012 696 Updated Nov 14, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,830 651 Updated Nov 20, 2025

🧙 Build, run, and manage data pipelines for integrating and transforming data.

Python 8,589 894 Updated Dec 16, 2025

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 7,918 598 Updated Jul 17, 2024

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 7,232 724 Updated Jan 22, 2025

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 7,060 521 Updated May 5, 2025

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,896 507 Updated Dec 13, 2025

NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥

Python 6,129 1,090 Updated Aug 8, 2024

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 6,000 693 Updated Dec 11, 2025

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 5,150 1,813 Updated Feb 26, 2025

Framework agnostic sliced/tiled inference + interactive ui + error analysis plots

Python 4,987 718 Updated Dec 15, 2025

Torchreid: Deep learning person re-identification in PyTorch.

Python 4,699 1,190 Updated Jul 22, 2024

[NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling

Python 4,132 323 Updated Sep 26, 2025

Images to inference with no labeling (use foundation models to train supervised models).

Python 2,546 206 Updated May 14, 2025

YOLOE: Real-Time Seeing Anything [ICCV 2025]

Python 1,950 184 Updated Jun 26, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,850 119 Updated Dec 16, 2025

Transform Web Content into LLM-Ready Data

TypeScript 1,551 168 Updated Dec 11, 2025

Implementation of "YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception".

Python 1,494 152 Updated Nov 18, 2025

[ICCV 2025] Implementation for Describe Anything: Detailed Localized Image and Video Captioning

Python 1,432 84 Updated Jun 26, 2025

Convert JSON annotations into YOLO format.

Python 1,155 261 Updated Jul 11, 2025
Next