Stars
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
PyTorch code and models for the DINOv2 self-supervised learning method.
A unified framework for 3D content generation.
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
A Modular Framework for 3D Gaussian Splatting and Beyond
Simple code for generating a color-coded latex table from raw data