ChrisDong-THU

😶‍🌫️

Coding

Jiajun Dong ChrisDong-THU

😶‍🌫️

Coding

17 followers · 9 following

Zhejiang U. -> Tsinghua U.
Shenzhen

Achievements

Highlights

Stars

VoyageWang / VG-Refiner

The repository of VG-Refiner paper

Python 16 Updated Dec 9, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,341 1,450 Updated Nov 28, 2025

lavinal712 / AutoencoderKL

Train Your VAE: A VAE Training and Finetuning Script for SD/FLUX

Python 63 1 Updated Dec 3, 2025

mit-han-lab / efficientvit

Efficient vision foundation models for high-resolution generation and perception.

Python 3,183 229 Updated Sep 5, 2025

hustvl / LightningDiT

[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 1,336 48 Updated Dec 16, 2025

sihyun-yu / REPA

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,470 71 Updated Mar 16, 2025

naver-ai / rope-vit

[ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"

Python 429 10 Updated Oct 29, 2025

CrossmodalGroup / DynamicVectorQuantization

Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization"

Python 192 7 Updated Jul 23, 2023

bytetriper / RAE

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,644 55 Updated Nov 15, 2025

SandAI-org / MAGI-1

MAGI-1: Autoregressive Video Generation at Scale

Python 3,618 228 Updated Jun 17, 2025

ShivamDuggal4 / adaptive-length-tokenizer

Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?

Python 141 7 Updated Feb 11, 2025

dichotomies / proxy-nca

PyTorch Implementation of `No Fuss Distance Metric Learning using Proxies`

Python 184 32 Updated May 18, 2020

dvlab-research / VisionThink

[NeurIPS 2025] Efficient Reasoning Vision Language Models

Python 440 29 Updated Sep 18, 2025

alipay / VCSL

Video Copy Segment Localization (VCSL) dataset and benchmark [CVPR2022]

Python 131 18 Updated Feb 4, 2024

zhang9302002 / ThinkingWithVideos

The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"

Python 71 1 Updated Oct 15, 2025

huggingface / sentence-transformers

State-of-the-Art Text Embeddings

Python 18,030 2,720 Updated Dec 22, 2025

AIR-THU / UniV2X

Python 147 8 Updated May 20, 2025

wang-jh18-SVM / Griffin

Griffin: Aerial-Ground Cooperative Detection and Tracking Benchmark

Python 79 7 Updated Aug 26, 2025

GuanxingLu / vlarl

Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.

Python 369 17 Updated Nov 8, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,717 2,871 Updated Dec 23, 2025

kohya-ss / sd-scripts

Python 6,807 1,152 Updated Dec 21, 2025

lukaslaobeyer / token-opt

Code for ICML 2025 Paper "Highly Compressed Tokenizer Can Generate Without Training"

Jupyter Notebook 195 12 Updated Jun 10, 2025

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 24,944 1,829 Updated Jul 31, 2025

kylesargent / FlowMo

Official PyTorch implementation of FlowMo.

Jupyter Notebook 105 6 Updated Apr 7, 2025

bytedance / 1d-tokenizer

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 1,086 59 Updated Mar 20, 2025

shiyi-zh0408 / FlexiAct

[SIGGRAPH 2025] Official code of the paper "FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios"

Jupyter Notebook 343 28 Updated Oct 30, 2025

lxa9867 / ImageFolder

High-performance Image Tokenizers for VAR and AR

Python 300 6 Updated Apr 25, 2025

lucidrains / titok-pytorch

Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"

Python 183 5 Updated Jun 20, 2024

VoyageWang / IteRPrimE

The official implementation of our paper ''IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word Emphasis''

Python 17 Updated Apr 6, 2025

MCG-NJU / MOTIP

[CVPR 2025] Multiple Object Tracking as ID Prediction

Python 441 33 Updated Aug 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jiajun Dong ChrisDong-THU

Achievements

Achievements

Highlights

Block or report ChrisDong-THU

Stars

VoyageWang / VG-Refiner

QwenLM / Qwen3-VL

lavinal712 / AutoencoderKL

mit-han-lab / efficientvit

hustvl / LightningDiT

sihyun-yu / REPA

naver-ai / rope-vit

CrossmodalGroup / DynamicVectorQuantization

bytetriper / RAE

SandAI-org / MAGI-1

ShivamDuggal4 / adaptive-length-tokenizer

dichotomies / proxy-nca

dvlab-research / VisionThink

alipay / VCSL

zhang9302002 / ThinkingWithVideos

huggingface / sentence-transformers

AIR-THU / UniV2X

wang-jh18-SVM / Griffin

GuanxingLu / vlarl

volcengine / verl

kohya-ss / sd-scripts

lukaslaobeyer / token-opt

black-forest-labs / flux

kylesargent / FlowMo

bytedance / 1d-tokenizer

shiyi-zh0408 / FlexiAct

lxa9867 / ImageFolder

lucidrains / titok-pytorch

VoyageWang / IteRPrimE

MCG-NJU / MOTIP