-
Peking University
- Shenzhen, China
-
04:32
(UTC +08:00) - https://scholar.google.com/citations?user=SYQoDk0AAAAJ&hl=zh-CN
Stars
AgentEvolver: Towards Efficient Self-Evolving Agent System
[ICLR 2026] GenCompositor: Generative Video Compositing with Diffusion Transformer
Q-Insight Family: Q-Insight, VQ-Insight and RALI (NeurIPS 2025 Spotlight, AAAI 2026 Oral, and ICLR 2026 Oral)
DreamO native implementation for ComfyUI
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
[SIGGRAPH Asia 2025] DreamO: A Unified Framework for Image Customization
Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"
Code for [CVPR 2025] ROICtrl: Boosting Instance Control for Visual Generation
[Visual Intelligence 2025] Hybrid Fourier Score Distillation for Efficient One Image to 3D Object Generation
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
[TCSVT 2024] D3C2-Net: Dual-Domain Deep Convolutional Coding Network for Compressive Sensing
[AAAI2025] DesignEdit: Unify Spatial-Aware Image Editing via Training-free Inpainting with a Multi-Layered Latent Diffusion Framework
Character Animation (AnimateAnyone, Face Reenactment)
Official code of SmartEdit [CVPR-2024 Highlight]
Official PyTorch implementation for the paper "AnimateZero: Video Diffusion Models are Zero-Shot Image Animators"
[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"
[ICLR 2024] Github Repo for "HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion"
WebUI extension for ControlNet
Official Code for DragGAN (SIGGRAPH 2023)
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
[NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Optimization-Inspired Cross-Attention Transformer for Compressive Sensing (CVPR 2023)
Official code for "AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation" (CVPR2023)