Lists (2)
Sort Name ascending (A-Z)
Starred repositories
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
DSPy: The framework for programming—not prompting—language models
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Development repository for the Triton language and compiler
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
PyTorch code and models for the DINOv2 self-supervised learning method.
Techniques for deep learning with satellite & aerial imagery
Sharp Monocular View Synthesis in Less Than a Second
Infinite Photorealistic Worlds using Procedural Generation
Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"
A curated list of papers & resources linked to 3D reconstruction from images.
LightGlue: Local Feature Matching at Light Speed (ICCV 2023)
PyTorch code and models for V-JEPA self-supervised learning from video.
Official implementation of Character Region Awareness for Text Detection (CRAFT)
pySLAM is a hybrid Python/C++ Visual SLAM pipeline supporting monocular, stereo, and RGB-D cameras. It provides a broad set of modern local and global feature extractors, multiple loop-closure stra…
An open source platform for visual-inertial navigation research.
Pangolin is a lightweight portable rapid development library for managing OpenGL display / interaction and abstracting video input.
[CVPR 2025 Best Paper Nomination] FoundationStereo: Zero-Shot Stereo Matching
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM (CVPR 2024)
InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
An easy-to-use Python library for processing and manipulating 3D point clouds and meshes.
This repository contains the official implementation of the research papers, "MobileCLIP" CVPR 2024 and "MobileCLIP2" TMLR August 2025