RizwanMunawar

❄️

Building something cool in computer vision 🚀

Muhammad Rizwan Munawar RizwanMunawar

❄️

Building something cool in computer vision 🚀

1k followers · 7 following

@ultralytics
Islamabad Pakistan
19:52 (UTC +05:00)
https://visionusecases.com/
in/muhammadrizwanmunawar
@muhammdrizwanmr
@muhammadrizwanmunawar
https://muhammadrizwanmunawar.medium.com/

Achievements

x3 x4 x3

Achievements

x3 x4 x3

Highlights

Developer Program Member

Organizations

Lists (2)

Sort

Data-preprocessing

💯 Data Preprocessing CV

1 repository

Stars

Significant-Gravitas / AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 180,348 46,186 Updated Dec 16, 2025

google-gemini / gemini-cli

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 87,739 10,043 Updated Dec 17, 2025

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 75,086 2,347 Updated Dec 17, 2025

ultralytics / yolov5

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 56,385 17,367 Updated Dec 17, 2025

ultralytics / ultralytics

Ultralytics YOLO 🚀

Python 50,039 9,662 Updated Dec 17, 2025

reflex-dev / reflex

🕸️ Web apps in pure Python 🐍

Python 27,823 1,667 Updated Dec 17, 2025

HumanSignal / label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

TypeScript 25,856 3,262 Updated Dec 17, 2025

apple / foundationdb

FoundationDB - the open source, distributed, transactional key-value store

C++ 16,014 1,455 Updated Dec 15, 2025

PaddlePaddle / PaddleDetection

Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

Python 13,981 3,004 Updated Oct 10, 2025

ultralytics / yolov3

YOLOv3 in PyTorch > ONNX > CoreML > TFLite

Python 10,523 3,451 Updated Dec 17, 2025

vikhyat / moondream

tiny vision language model

Python 9,012 696 Updated Nov 14, 2025

facebookresearch / dinov3

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,830 651 Updated Nov 20, 2025

mage-ai / mage-ai

🧙 Build, run, and manage data pipelines for integrating and transforming data.

Python 8,589 894 Updated Dec 16, 2025

LiheYoung / Depth-Anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 7,918 598 Updated Jul 17, 2024

DepthAnything / Depth-Anything-V2

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 7,232 724 Updated Jan 22, 2025

apple / ml-fastvlm

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 7,060 521 Updated May 5, 2025

gaomingqi / Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,896 507 Updated Dec 13, 2025

RangiLyu / nanodet

NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥

Python 6,129 1,090 Updated Aug 8, 2024

facebookresearch / sam3

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 6,000 693 Updated Dec 11, 2025

deepseek-ai / DeepSeek-VL2

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 5,150 1,813 Updated Feb 26, 2025

obss / sahi

Framework agnostic sliced/tiled inference + interactive ui + error analysis plots

Python 4,987 718 Updated Dec 15, 2025

KaiyangZhou / deep-person-reid

Torchreid: Deep learning person re-identification in PyTorch.

Python 4,699 1,190 Updated Jul 22, 2024

manycore-research / SpatialLM

[NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling

Python 4,132 323 Updated Sep 26, 2025

autodistill / autodistill

Images to inference with no labeling (use foundation models to train supervised models).

Python 2,546 206 Updated May 14, 2025

THU-MIG / yoloe

YOLOE: Real-Time Seeing Anything [ICCV 2025]

Python 1,950 184 Updated Jun 26, 2025

facebookresearch / perception_models

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,850 119 Updated Dec 16, 2025

watercrawl / WaterCrawl

Transform Web Content into LLM-Ready Data

TypeScript 1,551 168 Updated Dec 11, 2025

iMoonLab / yolov13

Implementation of "YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception".

Python 1,494 152 Updated Nov 18, 2025

NVlabs / describe-anything

[ICCV 2025] Implementation for Describe Anything: Detailed Localized Image and Video Captioning

Python 1,432 84 Updated Jun 26, 2025

ultralytics / JSON2YOLO

Convert JSON annotations into YOLO format.

Python 1,155 261 Updated Jul 11, 2025