Skip to content
View ChrisDong-THU's full-sized avatar
😶‍🌫️
Coding
😶‍🌫️
Coding
  • Zhejiang U. -> Tsinghua U.
  • Shenzhen

Highlights

  • Pro

Block or report ChrisDong-THU

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 42,002 5,077 Updated Apr 29, 2026
Python 38 1 Updated Apr 6, 2026

FIVR-200K dataset from the "FIVR: Fine-grained Incident Video Retrieval" [TMM 2019]

Python 81 9 Updated Apr 13, 2023

Official implementation of BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning. BandPO replaces canonical clipping (PPO/GRPO) with dynamic …

Python 48 4 Updated Apr 8, 2026

My Python scripts to make high-quality figures for publications in top AI conferences and journals.

Python 904 69 Updated Apr 29, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …

Python 13,976 1,390 Updated Apr 30, 2026
Python 142 8 Updated Jul 6, 2022

[CVPR 2025] Mr. DETR: Instructive Multi-Route Training for Detection Transformers

Python 174 12 Updated Sep 6, 2025

NetworKit is a growing open-source toolkit for large-scale network analysis.

C++ 852 244 Updated Apr 24, 2026

ICCV 2023 Paper Global Features are All You Need for Image Retrieval and Reranking Official Repository

Python 247 19 Updated Sep 14, 2023

The iconic SVG, font, and CSS toolkit

JavaScript 76,527 12,216 Updated Feb 10, 2026

The repository of VG-Refiner paper

Python 19 Updated Dec 9, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 19,099 1,748 Updated Jan 30, 2026

Train Your VAE: A VAE Training and Finetuning Script for SD/FLUX

Python 80 6 Updated Apr 20, 2026

Efficient vision foundation models for high-resolution generation and perception.

Python 3,295 242 Updated Sep 5, 2025

[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 1,457 56 Updated Dec 16, 2025

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,619 88 Updated Mar 16, 2025

[ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"

Python 459 13 Updated Oct 29, 2025

Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization"

Python 193 8 Updated Jul 23, 2023

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,870 80 Updated Feb 25, 2026

MAGI-1: Autoregressive Video Generation at Scale

Python 3,683 237 Updated Jun 17, 2025

Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?

Python 149 8 Updated Feb 11, 2025

PyTorch Implementation of `No Fuss Distance Metric Learning using Proxies`

Python 184 32 Updated May 18, 2020

[NeurIPS 2025] Efficient Reasoning Vision Language Models

Python 456 30 Updated Sep 18, 2025

Video Copy Segment Localization (VCSL) dataset and benchmark [CVPR2022]

Python 138 19 Updated Feb 4, 2024

The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"

Python 92 1 Updated Oct 15, 2025

State-of-the-Art Text Embeddings

Python 18,615 2,778 Updated Apr 30, 2026
Python 170 14 Updated May 20, 2025

Griffin: Aerial-Ground Cooperative Detection and Tracking Benchmark

Python 102 9 Updated Aug 26, 2025
Next