Skip to content
View PeizeSun's full-sized avatar

Block or report PeizeSun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MAGI-1: Autoregressive Video Generation at Scale

Python 3,607 227 Updated Jun 17, 2025

Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"

Python 422 22 Updated Jun 20, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,924 125 Updated Dec 18, 2025

[AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation

Python 454 24 Updated Mar 5, 2025

[CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/

Python 2,907 313 Updated Feb 19, 2025

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,052 522 Updated Jun 9, 2025

This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.

Python 1,390 68 Updated Aug 4, 2025

[NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training

Python 220 7 Updated Mar 20, 2025

[CVPR 2025 Highlight] The official CLIP training codebase of Inf-CL: "Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss". A super memory-efficiency CLIP training sc…

Python 274 12 Updated Jan 16, 2025

Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"

Python 36 Updated Feb 11, 2025

[ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models

Python 315 10 Updated Apr 24, 2025

Next-Token Prediction is All You Need

Python 2,266 91 Updated Nov 19, 2025

Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"

Python 633 32 Updated Oct 16, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,066 2,285 Updated Dec 25, 2024

[ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model

Python 36 Updated Nov 27, 2024

LLM101n: Let's build a Storyteller

35,901 1,962 Updated Aug 1, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 2,068 117 Updated Jul 29, 2024

SEED-Voken: A Series of Powerful Visual Tokenizers

Python 985 36 Updated Nov 25, 2025

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Python 583 45 Updated Jun 7, 2024
Python 436 44 Updated Sep 17, 2024

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,561 551 Updated Nov 10, 2025

Building a quick conversation-based search demo with Lepton AI.

TypeScript 8,127 1,023 Updated Dec 2, 2025

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 6,167 568 Updated Aug 22, 2025

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 9,394 1,247 Updated Dec 17, 2025

High-fidelity performance metrics for generative models in PyTorch

Python 1,156 85 Updated Nov 18, 2025

Open reproduction of MUSE for fast text2image generation.

Python 359 31 Updated Jun 1, 2024

Compute FID scores with PyTorch.

Python 3,815 524 Updated Jul 3, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 8,178 736 Updated May 31, 2024
Python 635 33 Updated Feb 15, 2024

[ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces

Python 236 15 Updated Feb 14, 2025
Next