Skip to content
View KimSoybean's full-sized avatar
  • JD AI Research
  • Shenzhen, China

Block or report KimSoybean

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,479 40 Updated Oct 15, 2025

All-In-One VLM: Image + Video + Transfer to Other Languages / Domains (TPAMI 2023)

Python 165 14 Updated Aug 22, 2024

Official Implementation of Diffusion Step Annealing (DiSA) in Autoregressive Image Generation

Jupyter Notebook 141 1 Updated May 27, 2025

This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation"

Python 238 9 Updated Oct 12, 2025

This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.

Python 1,382 66 Updated Aug 4, 2025

Codebase for evaluation of deep generative models as presented in Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models

Jupyter Notebook 195 17 Updated Mar 3, 2025

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 1,063 58 Updated Apr 1, 2025

Official Implementation for "Consistency Flow Matching: Defining Straight Flows with Velocity Consistency"

Python 246 11 Updated Jan 17, 2025

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,955 131 Updated Oct 30, 2024

[NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment

Python 57 2 Updated Sep 26, 2024
Python 626 49 Updated Apr 12, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,467 541 Updated May 18, 2025

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,256 321 Updated Feb 27, 2025
Python 2,548 306 Updated May 19, 2024

Open-Source implementation of FlexPredict paper (https://arxiv.org/pdf/2308.00566.pdf)

1 Updated Oct 4, 2023

[ICCV 2023 Oral] Official Implementation of "Denoising Diffusion Autoencoders are Unified Self-supervised Learners"

Python 182 8 Updated Feb 19, 2024

Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch

Python 912 88 Updated Feb 29, 2024

Code for Fast Training of Diffusion Models with Masked Transformers

Python 417 15 Updated May 15, 2024

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…

Python 3,100 425 Updated May 8, 2024

understanding model mistakes with human annotations

Jupyter Notebook 106 6 Updated Feb 22, 2023

A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).

857 54 Updated Jul 10, 2024

This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"

Python 198 21 Updated Jan 11, 2023
Python 61 7 Updated Sep 13, 2023

Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)

Python 461 34 Updated May 9, 2022

Pytorch Repo for DeepGCNs (ICCV'2019 Oral, TPAMI'2021), DeeperGCN (arXiv'2020) and GNN1000(ICML'2021): https://www.deepgcns.org

Python 1,181 156 Updated Jul 31, 2022

This is a offical PyTorch/GPU implementation of SupMAE.

Jupyter Notebook 79 4 Updated Aug 30, 2022

Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

Python 1,503 227 Updated Apr 3, 2024
Next