wdrink

Follow

Junke Wang wdrink

Follow

Ph.D. student from Fudan University, working on multimodal intelligence.

146 followers · 35 following

Fudan University
Shanghai
https://wdrink.github.io/

Achievements

Achievements

Pinned Loading

SimpleAR SimpleAR Public

Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"

Python 428 25
FoundationVision/OmniTokenizer FoundationVision/OmniTokenizer Public

[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

Python 324 8
X2FD/LVIS-INSTRUCT4V X2FD/LVIS-INSTRUCT4V Public

134
ARM ARM Public

ARM: An AutoRegressive Large Multimodal Model with Discrete Representations

44
RepWAM RepWAM Public

31 1
M2TR-Multi-modal-Multi-scale-Transformers-for-Deepfake-Detection M2TR-Multi-modal-Multi-scale-Transformers-for-Deepfake-Detection Public

Python 120 14