Stars
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
[CVPR 2025] Official code repository for "MaSS13K: A Matting-level Semantic Segmentation Benchmark"
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
Exploring variational-autoencoder-based semantic segmentation for analyzing CT-scans.
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
[CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).
The Replica Dataset v1 as published in https://arxiv.org/abs/1906.05797 .
This repo is the official implementation of iSeg: An Iterative Refinement-based Framework for Training-free Segmentation.
A Node JS module to read music files from iRealPro.
[CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation
The Missing Point in Vision Transformers for Universal Image Segmentation
The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official implementation of semantic segmentation for HRNet. https://arxiv.org/abs/1908.07919
The official homepage of the COCO-Stuff dataset.
📓 A curated list of deep learning image matting papers and codes
Python implementation of Poisson matting method
[Image and Vision Computing (Vol.147 Jul. '24)] Interactive Natural Image Matting with Segment Anything Models
C++/CUDA Dense Conditional Random Field using Meanfield inference
A high-performance JavaScript library for 3D and 2D computational geometry in the browser, powered by Geogram (via WebAssembly) and visualized with Three.js.
LINEA: Fast and accurate line detection using scalable transformers [ICIP 2025]
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
Optimized, non-recursive flood fill using a scan line search
Demonstration of MobileSAM in the browser enabled through ONNX runtime web