Highlights
- Pro
Lists (10)
Sort Name ascending (A-Z)
Starred repositories
21 Lessons, Get Started Building with Generative AI
12 Lessons to Get Started Building AI Agents
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filte…
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
High-Resolution Image Synthesis with Latent Diffusion Models
PyTorch code and models for the DINOv2 self-supervised learning method.
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
Code release for NeRF (Neural Radiance Fields)
Official inference library for Mistral models
Reference PyTorch implementation and models for DINOv3
A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like RF-DETR, YOLO11, SAM …
NVIDIA Isaac GR00T N1.6 - A Foundation Model for Generalist Robots.
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
CoTracker is a model for tracking any point (pixel) on a video.
Segment Anything in High Quality [NeurIPS 2023]
Open-source and strong foundation image recognition models.
[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering
GTSAM is a library of C++ classes that implement smoothing and mapping (SAM) in robotics and vision, using factor graphs and Bayes networks as the underlying computing paradigm rather than sparse m…
LLM Finetuning with peft
Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022
Efficient neural feature detector and descriptor
Handout for the tutorial "Creating publication-quality figures with matplotlib"
Let us democratise high-resolution generation! (CVPR 2024)
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
Benchmarking Knowledge Transfer in Lifelong Robot Learning