PeizeSun

Peize Sun PeizeSun

PhD student, The University of Hong Kong, Computer Vision

570 followers · 46 following

The University of Hong Kong
Hong Kong
https://peizesun.github.io/

Achievements

Stars

SandAI-org / MAGI-1

MAGI-1: Autoregressive Video Generation at Scale

Python 3,607 227 Updated Jun 17, 2025

wdrink / SimpleAR

Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"

Python 422 22 Updated Jun 20, 2025

facebookresearch / perception_models

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,924 125 Updated Dec 18, 2025

FoundationVision / FlashVideo

[AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation

Python 454 24 Updated Mar 5, 2025

Saiyan-World / goku

[CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/

Python 2,907 313 Updated Feb 19, 2025

NVIDIA / Cosmos

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,052 522 Updated Jun 9, 2025

apple / ml-aim

This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.

Python 1,390 68 Updated Aug 4, 2025

x-cls / superclass

[NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training

Python 220 7 Updated Mar 20, 2025

DAMO-NLP-SG / Inf-CLIP

[CVPR 2025 Highlight] The official CLIP training codebase of Inf-CL: "Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss". A super memory-efficiency CLIP training sc…

Python 274 12 Updated Jan 16, 2025

thu-ml / CCA

Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"

Python 36 Updated Feb 11, 2025

hustvl / ControlAR

[ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models

Python 315 10 Updated Apr 24, 2025

baaivision / Emu3

Next-Token Prediction is All You Need

Python 2,266 91 Updated Nov 19, 2025

Alpha-VLLM / Lumina-mGPT

Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"

Python 633 32 Updated Oct 16, 2025

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,066 2,285 Updated Dec 25, 2024

jiyt17 / IDA-VLM

[ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model

Python 36 Updated Nov 27, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

35,901 1,962 Updated Aug 1, 2024

facebookresearch / chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 2,068 117 Updated Jul 29, 2024

TencentARC / SEED-Voken

SEED-Voken: A Series of Powerful Visual Tokenizers

Python 985 36 Updated Nov 25, 2025

FoundationVision / Groma

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Python 583 45 Updated Jun 7, 2024

ali-vilab / FlashFace

Python 436 44 Updated Sep 17, 2024

FoundationVision / VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,561 551 Updated Nov 10, 2025

leptonai / search_with_lepton

Building a quick conversation-based search demo with Lepton AI.

TypeScript 8,127 1,023 Updated Dec 2, 2025

meta-pytorch / gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 6,167 568 Updated Aug 22, 2025

huggingface / accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 9,394 1,247 Updated Dec 17, 2025

toshas / torch-fidelity

High-fidelity performance metrics for generative models in PyTorch

Python 1,156 85 Updated Nov 18, 2025

huggingface / open-muse

Open reproduction of MUSE for fast text2image generation.

Python 359 31 Updated Jun 1, 2024

mseitzer / pytorch-fid

Compute FID scores with PyTorch.

Python 3,815 524 Updated Jul 3, 2024

facebookresearch / DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 8,178 736 Updated May 31, 2024

allenai / unified-io-2

Python 635 33 Updated Feb 15, 2024

FoundationVision / UniRef

[ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces

Python 236 15 Updated Feb 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Peize Sun PeizeSun

Achievements

Achievements

Block or report PeizeSun

Stars

SandAI-org / MAGI-1

wdrink / SimpleAR

facebookresearch / perception_models

FoundationVision / FlashVideo

Saiyan-World / goku

NVIDIA / Cosmos

apple / ml-aim

x-cls / superclass

DAMO-NLP-SG / Inf-CLIP

thu-ml / CCA

hustvl / ControlAR

baaivision / Emu3

Alpha-VLLM / Lumina-mGPT

facebookresearch / sam2

jiyt17 / IDA-VLM

karpathy / LLM101n

facebookresearch / chameleon

TencentARC / SEED-Voken

FoundationVision / Groma

ali-vilab / FlashFace

FoundationVision / VAR

leptonai / search_with_lepton

meta-pytorch / gpt-fast

huggingface / accelerate

toshas / torch-fidelity

huggingface / open-muse

mseitzer / pytorch-fid

facebookresearch / DiT

allenai / unified-io-2

FoundationVision / UniRef