shengming-yin

ShengmingYin shengming-yin

Achievements

Highlights

Stars

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,172 31,060 Updated Nov 6, 2025

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,386 6,132 Updated Sep 18, 2024

XingangPan / DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,986 3,443 Updated May 18, 2024

chenfei-wu / TaskMatrix

Python 34,354 3,272 Updated Jan 6, 2024

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,917 6,623 Updated Sep 30, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,504 6,474 Updated Nov 6, 2025

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 31,445 3,821 Updated Jul 23, 2024

Stability-AI / generative-models

Generative Models by Stability AI

Python 26,568 2,977 Updated Nov 3, 2025

Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,745 2,932 Updated Sep 2, 2024

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,270 1,760 Updated Oct 13, 2025

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 24,601 1,808 Updated Jul 31, 2025

lucidrains / vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,358 3,427 Updated Oct 28, 2025

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 23,908 2,659 Updated Aug 12, 2024

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 19,652 1,640 Updated Sep 30, 2025

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 17,558 2,186 Updated Dec 25, 2024

lllyasviel / FramePack

Lets make video diffusion practical!

Python 16,112 1,547 Updated Oct 16, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,638 2,111 Updated Jul 17, 2025

mlfoundations / open_clip

An open source implementation of CLIP.

Python 12,894 1,193 Updated Nov 4, 2025

openai / shap-e

Generate 3D objects conditioned on text or images

Python 12,125 1,048 Updated Jun 22, 2024

guoyww / AnimateDiff

Official implementation of AnimateDiff.

Python 11,843 1,020 Updated Jul 31, 2024

lucidrains / DALLE2-pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

Python 11,333 1,096 Updated May 11, 2024

IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 9,205 945 Updated Aug 12, 2024

Lightricks / LTX-Video

Official repository for LTX-Video

Python 8,709 798 Updated Oct 25, 2025

FoundationVision / VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,469 541 Updated May 18, 2025

houtianze / bypy

Python client for Baidu Yun (Personal Cloud Storage) 百度云/百度网盘Python客户端

Python 8,410 1,426 Updated Apr 2, 2025

deep-floyd / IF

Python 7,836 523 Updated Apr 14, 2024

PeterL1n / BackgroundMattingV2

Real-Time High-Resolution Background Matting

Python 7,109 966 Updated Jun 19, 2024

gaomingqi / Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,871 504 Updated May 31, 2024

timothybrooks / instruct-pix2pix

Python 6,830 574 Updated Mar 3, 2024

openai / point-e

Point cloud diffusion for 3D model synthesis

Python 6,816 796 Updated Jul 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ShengmingYin shengming-yin

Achievements

Achievements

Highlights

Block or report shengming-yin

Stars

huggingface / transformers

facebookresearch / segment-anything

XingangPan / DragGAN

chenfei-wu / TaskMatrix

facebookresearch / fairseq

huggingface / diffusers

openai / CLIP

Stability-AI / generative-models

Vision-CAIR / MiniGPT-4

QwenLM / Qwen3

black-forest-labs / flux

lucidrains / vit-pytorch

haotian-liu / LLaVA

QwenLM / Qwen

facebookresearch / sam2

lllyasviel / FramePack

Wan-Video / Wan2.1

mlfoundations / open_clip

openai / shap-e

guoyww / AnimateDiff

lucidrains / DALLE2-pytorch

IDEA-Research / GroundingDINO

Lightricks / LTX-Video

FoundationVision / VAR

houtianze / bypy

deep-floyd / IF

PeterL1n / BackgroundMattingV2

gaomingqi / Track-Anything

timothybrooks / instruct-pix2pix

openai / point-e