Skip to content
View shengming-yin's full-sized avatar

Highlights

  • Pro

Block or report shengming-yin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,172 31,060 Updated Nov 6, 2025

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,386 6,132 Updated Sep 18, 2024

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,986 3,443 Updated May 18, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,917 6,623 Updated Sep 30, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,504 6,474 Updated Nov 6, 2025

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 31,445 3,821 Updated Jul 23, 2024

Generative Models by Stability AI

Python 26,568 2,977 Updated Nov 3, 2025

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,745 2,932 Updated Sep 2, 2024

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,270 1,760 Updated Oct 13, 2025

Official inference repo for FLUX.1 models

Python 24,601 1,808 Updated Jul 31, 2025

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,358 3,427 Updated Oct 28, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 23,908 2,659 Updated Aug 12, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 19,652 1,640 Updated Sep 30, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 17,558 2,186 Updated Dec 25, 2024

Lets make video diffusion practical!

Python 16,112 1,547 Updated Oct 16, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,638 2,111 Updated Jul 17, 2025

An open source implementation of CLIP.

Python 12,894 1,193 Updated Nov 4, 2025

Generate 3D objects conditioned on text or images

Python 12,125 1,048 Updated Jun 22, 2024

Official implementation of AnimateDiff.

Python 11,843 1,020 Updated Jul 31, 2024

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

Python 11,333 1,096 Updated May 11, 2024

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 9,205 945 Updated Aug 12, 2024

Official repository for LTX-Video

Python 8,709 798 Updated Oct 25, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,469 541 Updated May 18, 2025

Python client for Baidu Yun (Personal Cloud Storage) 百度云/百度网盘Python客户端

Python 8,410 1,426 Updated Apr 2, 2025
Python 7,836 523 Updated Apr 14, 2024

Real-Time High-Resolution Background Matting

Python 7,109 966 Updated Jun 19, 2024

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,871 504 Updated May 31, 2024

Point cloud diffusion for 3D model synthesis

Python 6,816 796 Updated Jul 4, 2024
Next