The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 53,383 6,229 Updated Sep 18, 2024

TencentARC / GFPGAN

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

Python 37,382 6,280 Updated Jul 26, 2024

google-research / google-research

Google Research

Jupyter Notebook 37,235 8,327 Updated Feb 6, 2026

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 36,335 5,117 Updated Feb 6, 2026

XingangPan / DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,976 3,441 Updated May 18, 2024

lllyasviel / ControlNet

Let us control diffusion models!

Python 33,624 3,000 Updated Feb 25, 2024

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 32,718 6,750 Updated Feb 8, 2026

Lightning-AI / pytorch-lightning

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 30,811 3,669 Updated Feb 4, 2026

s0md3v / roop

one-click face swap

Python 30,508 6,911 Updated Aug 19, 2024

iperov / DeepFaceLive

Real-time face swap for PC streaming or video calls

Python 30,485 1,111 Updated Nov 8, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 28,502 2,885 Updated Apr 30, 2025

Stability-AI / generative-models

Generative Models by Stability AI

Python 26,894 3,036 Updated Dec 16, 2025

facefusion / facefusion

Industry leading face manipulation platform

Python 26,674 4,282 Updated Feb 7, 2026

OpenBMB / MiniCPM-o

A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone

Python 23,271 1,772 Updated Feb 8, 2026

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 22,977 2,568 Updated Mar 13, 2025

serengil / deepface

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Python 22,155 3,027 Updated Jan 25, 2026

amusi / CVPR2025-Papers-with-Code

CVPR 2025 论文和开源项目合集

21,850 2,779 Updated Jul 2, 2025

PicoTrex / Awesome-Nano-Banana-images

A curated collection of fun and creative examples generated with Nano Banana & Nano Banana Pro🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the commu…

20,715 2,145 Updated Dec 12, 2025

index-tts / index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 18,603 2,298 Updated Dec 2, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 18,186 1,404 Updated Feb 7, 2026

sczhou / CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Python 17,796 3,697 Updated Nov 18, 2025

eriklindernoren / PyTorch-GAN

PyTorch implementations of Generative Adversarial Networks.

Python 17,426 4,099 Updated Jun 18, 2024

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 17,386 1,572 Updated Sep 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

yongzhang6782 yzhang2016

Achievements

Achievements

Block or report yzhang2016

Stars

TheAlgorithms / Python

AUTOMATIC1111 / stable-diffusion-webui

rasbt / LLMs-from-scratch

CompVis / stable-diffusion

openai / openai-cookbook

FoundationAgents / MetaGPT

openai / codex

facebookresearch / segment-anything