Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 17,577 1,591 Updated Sep 5, 2024

OpenDriveLab / OccNet

[ICCV 2023] OccNet: Scene as Occupancy

Python 689 58 Updated Jul 2, 2025

OpenGVLab / CaFo

[CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners

Python 380 20 Updated Jun 1, 2023

sunanhe / MKT

Official implementation of "Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer".

Python 129 6 Updated Nov 7, 2024

dair-ai / Prompt-Engineering-Guide

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 74,800 8,101 Updated Mar 11, 2026

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

30,130 2,427 Updated Jun 18, 2024

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 33,531 4,009 Updated Mar 25, 2026

akshitac8 / OW-DETR

[CVPR 2022] Official Pytorch code for OW-DETR: Open-world Detection Transformer

Python 258 46 Updated Apr 4, 2023

NVlabs / instant-ngp

Instant neural graphics primitives: lightning fast NeRF and more

Cuda 17,401 2,064 Updated Feb 2, 2026

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 36,822 5,164 Updated May 8, 2026

Oneflow-Inc / oneflow

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

C++ 9,395 1,015 Updated Dec 4, 2025

google-ai-edge / mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

C++ 35,284 5,982 Updated May 20, 2026

aosabook / 500lines

500 Lines or Less

JavaScript 29,591 5,838 Updated Aug 19, 2023

Megvii-BaseDetection / YOLOX

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

Python 10,467 2,470 Updated Jun 8, 2025

deepinsight / insightface

State-of-the-art 2D and 3D Face Analysis Project

Python 28,738 6,018 Updated Mar 29, 2026

rgushel / protobuf-converter

protobuf-converter is library for transforming your Domain Model Objects into Google Protobuf Messages and vice versa.

Java 137 43 Updated Dec 26, 2020

nvbn / thefuck

Magnificent app which corrects your previous console command.

Python 97,017 3,954 Updated Jul 19, 2024

mbadolato / iTerm2-Color-Schemes

Over 450 terminal color schemes/themes for iTerm/iTerm2. Includes ports to Terminal, Konsole, PuTTY, Xresources, XRDB, Remmina, Termite, XFCE, Tilda, FreeBSD VT, Terminator, Kitty, MobaXterm, LXTer…

Shell 26,892 6,513 Updated May 20, 2026

romkatv / powerlevel10k

A Zsh theme

Shell 54,221 2,422 Updated Mar 14, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ruri shippingwang

Achievements

Achievements

Organizations

Block or report shippingwang

Stars

apple / ml-mobileclip

ZhengPeng7 / BiRefNet

RoyalCities / RC-stable-audio-tools

siyuanliii / masa

shenyunhang / APE

hiyouga / LlamaFactory

BradyFU / Awesome-Multimodal-Large-Language-Models

jzhang38 / TinyLlama

apple / ml-ferret

cumulo-autumn / StreamDiffusion

ali-vilab / AnyDoor

IDEA-Research / Grounded-Segment-Anything