KimSoybean

KimSoybean

PhD student in CUHK(SZ)

61 followers · 39 following

JD AI Research
Shenzhen, China

Achievements

Stars

170 results for source starred repositories

Clear filter

bytetriper / RAE

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,479 40 Updated Oct 15, 2025

zengyan-97 / X2-VLM

All-In-One VLM: Image + Video + Transfer to Other Languages / Domains (TPAMI 2023)

Python 165 14 Updated Aug 22, 2024

Qinyu-Allen-Zhao / DiSA

Official Implementation of Diffusion Step Annealing (DiSA) in Autoregressive Image Generation

Jupyter Notebook 141 1 Updated May 27, 2025

OliverRensu / xAR

This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation"

Python 238 9 Updated Oct 12, 2025

apple / ml-aim

This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.

Python 1,382 66 Updated Aug 4, 2025

layer6ai-labs / dgm-eval

Codebase for evaluation of deep generative models as presented in Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models

Jupyter Notebook 195 17 Updated Mar 3, 2025

buoyancy99 / diffusion-forcing

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 1,064 58 Updated Apr 1, 2025

YangLing0818 / consistency_flow_matching

Official Implementation for "Consistency Flow Matching: Defining Straight Flows with Velocity Consistency"

Python 246 11 Updated Jan 17, 2025

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,955 131 Updated Oct 30, 2024

dongzhuoyao / Diffusion-Representation-Learning-Survey-Taxonomy

102 1 Updated Oct 23, 2024

foundation-multimodal-models / CAL

[NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment

Python 57 2 Updated Sep 26, 2024

minyoungg / platonic-rep

Python 627 49 Updated Apr 12, 2025

FoundationVision / VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,467 541 Updated May 18, 2025