Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 17,635 1,594 Updated Sep 5, 2024

WSNLP / al_toolbox

Active learning

Python 78 10 Updated Feb 8, 2023

cvat-ai / cvat

Computer Vision Annotation Tool (CVAT) is a leading platform for building high-quality visual datasets for vision AI. It offers open-source, cloud, and enterprise products, as well as labeling serv…

Python 16,120 3,716 Updated Jun 22, 2026

amazon-science / object-centric-learning-framework

Python 95 9 Updated Aug 13, 2025

microsoft / JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 24,875 2,152 Updated Jul 29, 2025

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 54,372 6,363 Updated Sep 18, 2024

mlfoundations / open_flamingo

An open-source framework for training large multimodal models.

Python 4,107 321 Updated Aug 31, 2024

amazon-science / self-supervised-amodal-video-object-segmentation

Python 19 7 Updated Feb 21, 2024

huggingface / simulate

🎢 Creating and sharing simulation environments for embodied and synthetic data research

Python 194 14 Updated May 26, 2026

nateraw / stable-diffusion-videos

Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts

Python 4,693 445 Updated Dec 16, 2025

visionml / pytracking

Visual tracking library based on PyTorch.

Python 3,504 613 Updated Aug 8, 2024

dragonlong / Trending-in-3D-Vision

An on-going paper list on new trends in 3D vision with deep learning

332 31 Updated Jun 17, 2022

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 33,831 4,020 Updated Mar 25, 2026

dmlc / GNNLens2

Visualization tool for Graph Neural Networks

TypeScript 261 29 Updated Sep 20, 2022

facebookresearch / Replica-Dataset

The Replica Dataset v1 as published in https://arxiv.org/abs/1906.05797 .

C++ 1,288 111 Updated Jul 22, 2024

google-research / kubric

A data generation pipeline for creating semi-realistic synthetic multi-object videos with rich annotations such as instance segmentation masks, depth maps, and optical flow.

Jupyter Notebook 2,758 275 Updated May 21, 2026

2019ChenGong / Machine-Learning-Notes

531 106 Updated May 16, 2021

detectRecog / PointTrack

PointTrack (ECCV2020 ORAL): Segment as Points for Efficient Online Multi-Object Tracking and Segmentation

Python 265 47 Updated Oct 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tianjun Xiao sneakerkg

Achievements

Achievements

Organizations

Block or report sneakerkg

Stars

karpathy / LLM101n

showlab / Awesome-MLLM-Hallucination

tinygrad / tinygrad

patrick-llgc / Learning-Deep-Learning

OpenDriveLab / End-to-end-Autonomous-Driving

YangLing0818 / RPG-DiffusionMaster

amazon-science / instruct-video-to-video

roboflow / supervision

amazon-science / object-centric-multiple-object-tracking

JIA-Lab-research / LISA

uncbiag / Awesome-Foundation-Models

ttengwang / Awesome_Long_Form_Video_Understanding

IDEA-Research / Grounded-Segment-Anything