Stars
The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while control…
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
This is Pytorch re-implementation of our CVPR 2020 paper "Panoptic-DeepLab: A Simple, Strong, and Fast Baseline for Bottom-Up Panoptic Segmentation" (https://arxiv.org/abs/1911.10194)
Datasets, Transforms and Models specific to Computer Vision
Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image …
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
Implementation of YOSO [CVPR 2023] by MMDetection3.x
Revisiting K-Net for Real-Time Panoptic Segmentation. Code release for our IV 2023 paper.
[CVPR2023] FastInst: A Simple Query-Based Model for Real-Time Instance Segmentation
ISTR: End-to-End Instance Segmentation with Transformers (https://arxiv.org/abs/2105.00637)
Code release for paper "You Only Segment Once: Towards Real-Time Panoptic Segmentation" [CVPR 2023]
Fully Convolutional Networks for Panoptic Segmentation (CVPR2021 Oral)
Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)
[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"
[ICLR 2022] Official implementation of the paper "DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR"
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
[NeurIPS 2021] You Only Look at One Sequence
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
[CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation
[CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).
This repository contains demos I made with the Transformers library by HuggingFace.
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
[ICCV2021] Official PyTorch implementation of Segmenter: Transformer for Semantic Segmentation
The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"
The project is an official implement of our ECCV2018 paper "Simple Baselines for Human Pose Estimation and Tracking(https://arxiv.org/abs/1804.06208)"
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box