Skip to content
View ChrisDong-THU's full-sized avatar
😶‍🌫️
Coding
😶‍🌫️
Coding
  • Zhejiang U. -> Tsinghua U.
  • Shenzhen

Highlights

  • Pro

Block or report ChrisDong-THU

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The repository of VG-Refiner paper

Python 16 Updated Dec 9, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,341 1,450 Updated Nov 28, 2025

Train Your VAE: A VAE Training and Finetuning Script for SD/FLUX

Python 63 1 Updated Dec 3, 2025

Efficient vision foundation models for high-resolution generation and perception.

Python 3,183 229 Updated Sep 5, 2025

[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 1,336 48 Updated Dec 16, 2025

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,470 71 Updated Mar 16, 2025

[ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"

Python 429 10 Updated Oct 29, 2025

Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization"

Python 192 7 Updated Jul 23, 2023

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,644 55 Updated Nov 15, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,618 228 Updated Jun 17, 2025

Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?

Python 141 7 Updated Feb 11, 2025

PyTorch Implementation of `No Fuss Distance Metric Learning using Proxies`

Python 184 32 Updated May 18, 2020

[NeurIPS 2025] Efficient Reasoning Vision Language Models

Python 440 29 Updated Sep 18, 2025

Video Copy Segment Localization (VCSL) dataset and benchmark [CVPR2022]

Python 131 18 Updated Feb 4, 2024

The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"

Python 71 1 Updated Oct 15, 2025

State-of-the-Art Text Embeddings

Python 18,030 2,720 Updated Dec 22, 2025
Python 147 8 Updated May 20, 2025

Griffin: Aerial-Ground Cooperative Detection and Tracking Benchmark

Python 79 7 Updated Aug 26, 2025

Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.

Python 369 17 Updated Nov 8, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,717 2,871 Updated Dec 23, 2025
Python 6,807 1,152 Updated Dec 21, 2025

Code for ICML 2025 Paper "Highly Compressed Tokenizer Can Generate Without Training"

Jupyter Notebook 195 12 Updated Jun 10, 2025

Official inference repo for FLUX.1 models

Python 24,944 1,829 Updated Jul 31, 2025

Official PyTorch implementation of FlowMo.

Jupyter Notebook 105 6 Updated Apr 7, 2025

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 1,086 59 Updated Mar 20, 2025

[SIGGRAPH 2025] Official code of the paper "FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios"

Jupyter Notebook 343 28 Updated Oct 30, 2025

High-performance Image Tokenizers for VAR and AR

Python 300 6 Updated Apr 25, 2025

Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"

Python 183 5 Updated Jun 20, 2024

The official implementation of our paper ''IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word Emphasis''

Python 17 Updated Apr 6, 2025

[CVPR 2025] Multiple Object Tracking as ID Prediction

Python 441 33 Updated Aug 20, 2025
Next