Skip to content
View guozinan126's full-sized avatar

Block or report guozinan126

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 178 12 Updated Oct 28, 2024

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,543 83 Updated Nov 4, 2025

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 2,373 231 Updated Aug 28, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,113 545 Updated Nov 3, 2025

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 549 33 Updated Nov 5, 2025

Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023

Python 1,334 86 Updated Aug 10, 2023

Lets make video diffusion practical!

Python 16,080 1,545 Updated Oct 16, 2025

Optimus: the first large-scale pre-trained VAE language model

Python 391 41 Updated Sep 6, 2023

MAGI-1: Autoregressive Video Generation at Scale

Python 3,531 209 Updated Jun 17, 2025

[SIGGRAPH Asia 2025] DreamO: A Unified Framework for Image Customization

Python 1,716 129 Updated Aug 14, 2025

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

Python 1,447 90 Updated Sep 11, 2025

SynCD: Generating Multi-Image Synthetic Data for Text-to-Image Customization (ICCV 2025)

Python 148 13 Updated Oct 16, 2025

Subjects200K dataset

Jupyter Notebook 121 3 Updated Jan 17, 2025

[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Python 3,485 261 Updated Jul 31, 2025

Rembg is a tool to remove images background

Python 20,938 2,161 Updated Oct 25, 2025

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 6,288 403 Updated Jun 28, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 17,092 1,548 Updated Sep 5, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,372 6,131 Updated Sep 18, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,344 467 Updated Aug 7, 2024

Official inference repo for FLUX.1 models

Python 24,598 1,808 Updated Jul 31, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,234 1,120 Updated Aug 27, 2025

[ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer

Python 1,814 137 Updated Jul 3, 2025

The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)

Python 80 4 Updated Apr 23, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,494 6,470 Updated Nov 4, 2025