KimSoybean

KimSoybean

PhD student in CUHK(SZ)

61 followers · 39 following

JD AI Research
Shenzhen, China

Achievements

Stars

bytetriper / RAE

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,479 40 Updated Oct 15, 2025

zengyan-97 / X2-VLM

All-In-One VLM: Image + Video + Transfer to Other Languages / Domains (TPAMI 2023)

Python 165 14 Updated Aug 22, 2024

Qinyu-Allen-Zhao / DiSA

Official Implementation of Diffusion Step Annealing (DiSA) in Autoregressive Image Generation

Jupyter Notebook 141 1 Updated May 27, 2025

OliverRensu / xAR

This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation"

Python 238 9 Updated Oct 12, 2025

apple / ml-aim

This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.

Python 1,382 66 Updated Aug 4, 2025

layer6ai-labs / dgm-eval

Codebase for evaluation of deep generative models as presented in Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models

Jupyter Notebook 195 17 Updated Mar 3, 2025

buoyancy99 / diffusion-forcing

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 1,063 58 Updated Apr 1, 2025

YangLing0818 / consistency_flow_matching

Official Implementation for "Consistency Flow Matching: Defining Straight Flows with Velocity Consistency"

Python 246 11 Updated Jan 17, 2025

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,955 131 Updated Oct 30, 2024

dongzhuoyao / Diffusion-Representation-Learning-Survey-Taxonomy

102 1 Updated Oct 23, 2024

foundation-multimodal-models / CAL

[NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment

Python 57 2 Updated Sep 26, 2024

minyoungg / platonic-rep

Python 626 49 Updated Apr 12, 2025

FoundationVision / VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,467 541 Updated May 18, 2025

facebookresearch / jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,256 321 Updated Feb 27, 2025

openai / weak-to-strong

Python 2,548 306 Updated May 19, 2024

Natyren / FlexPredict

Open-Source implementation of FlexPredict paper (https://arxiv.org/pdf/2308.00566.pdf)

1 Updated Oct 4, 2023

FutureXiang / ddae

[ICCV 2023 Oral] Official Implementation of "Denoising Diffusion Autoencoders are Unified Self-supervised Learners"

Python 182 8 Updated Feb 19, 2024

lucidrains / muse-maskgit-pytorch

Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch

Python 912 88 Updated Feb 29, 2024

Anima-Lab / MaskDiT

Code for Fast Training of Diffusion Models with Masked Transformers

Python 417 15 Updated May 15, 2024

facebookresearch / ijepa

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…

Python 3,100 425 Updated May 8, 2024