-
Peking University
- Shenzhen, China
-
14:44
(UTC +08:00) - https://scholar.google.com/citations?user=SYQoDk0AAAAJ&hl=zh-CN
Stars
AgentEvolver: Towards Efficient Self-Evolving Agent System
[ICLR 2026] GenCompositor: Generative Video Compositing with Diffusion Transformer
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Q-Insight Family: Q-Insight, VQ-Insight and RALI (NeurIPS 2025 Spotlight, AAAI 2026 Oral, and ICLR 2026 Oral)
[Visual Intelligence 2025] Hybrid Fourier Score Distillation for Efficient One Image to 3D Object Generation
[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"
DreamO native implementation for ComfyUI
[SIGGRAPH Asia 2025] DreamO: A Unified Framework for Image Customization
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"
Code for [CVPR 2025] ROICtrl: Boosting Instance Control for Visual Generation
[AAAI2025] DesignEdit: Unify Spatial-Aware Image Editing via Training-free Inpainting with a Multi-Layered Latent Diffusion Framework
[TCSVT 2024] D3C2-Net: Dual-Domain Deep Convolutional Coding Network for Compressive Sensing
WebUI extension for ControlNet
[NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Official code of SmartEdit [CVPR-2024 Highlight]
Character Animation (AnimateAnyone, Face Reenactment)
Official Code for DragGAN (SIGGRAPH 2023)
NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
A Collection of Papers and Codes in CVPR2023/2022 about low level vision
pytorch structural similarity (SSIM) loss
Codes for "Metric Learning based Interactive Modulation for Real-World Super-Resolution"
Simple framework for image and video deblurring, implemented by PyTorch