头像1
头像2

Oliver Yanzuo Lu 卢彦作

PhD Student, Imperial College London

Contact: oliveryanzuolu AT gmail DOT com

<- Click my avatar to find me :)

Google Scholar     GitHub     LinkedIn     CV

👋 Hi, this is Yanzuo Lu, a PhD student at Imperial College London supervised by Jiankang Deng. My current research focuses on building Real-Time and Long-Video World Models, involving three main areas: (1) real-time generation, with a focus on achieving inference speeds where the processing time for each video chunk is less than its duration; (2) long-video generation, with the key challenge being the mitigation of error accumulation between chunks to maintain long-term consistency; (3) interactive world model, with the objective of developing methods for effective control signal injection and responsive, timely feedback.

My Chinese name is 卢彦作, and you may also want to call my English name as Oliver Lu. I love watching anime and playing video games in my spare time. My favorite anime include Demon Slayer: Kimetsu no Yaiba (鬼滅の刃), Attack on Titan (進撃の巨人), Frieren: Beyond Journey's End (葬送のフリーレン), Eighty Six (86―エイティシックス―) and Violet Evergarden (ヴァイオレット・エヴァーガーデン). Recently I've also been trying to get into landscape photography.

Feel free to explore my works below and reach out via email for any discussion.


Latest News

 
May 2026 We propose RAVEN, a training-time test framework for distilling real-time autoregressive video generation, together with CM-GRPO, which brings online RL to consistency-model sampling. The code and models are publicly released.
Oct 2025 I start my PhD at Imperial College London supervised by Jiankang Deng with a fully-funded doctoral scholarship. My research focuses on real-time and long-video world models.
Sep 2025 I leave ByteDance Seed Team with sincere thanks to my close collaborators over the past nearly two years (Weifeng Chen, Huafeng Kuang, Yuxi Ren, Xin Xia, Jie Wu & Xing Wang, etc.), and to my leader Xuefeng Xiao for the valuable opportunity.
Sep 2025 We migrate the acceleration recipes from Seedream models to Bagel, a unified model for understanding and generation, further refining the Hyper-SD methodology into Hyper-Bagel.
Sep 2025 As a core contributor, I work on the acceleration of Seedream 4.0, which is fully rolled out on ByteDance platforms including Doubao alongside the release of its technical report.
Jul 2025 We make an attempt in score distillation by combining it with adversarial training objectives, and our technical report DMDX is accepted to ICCV 2025 as a Highlight.
Jun 2025 I graduate with my MEng from Sun Yat-Sen University and convert to a full-time employee at ByteDance Seed Team.
Apr 2025 I contribute to the acceleration of Seedream 3.0, which is fully rolled out on ByteDance platforms including Doubao, with its technical report released.
Sep 2024 Hyper-SD is accepted to NeurIPS 2024.
Aug 2024 I take charge of training the FLUX and SD3 acceleration LoRAs as an extension of Hyper-SD, featured again on the Hugging Face Trending list.
Jul 2024 Our diffusion-based image editing acceleration work ByteEdit, where I serve as co-first author and use adversarial training and reinforcement learning, is accepted to ECCV 2024.
Apr 2024 We release Hyper-SD, a new state-of-the-art diffusion model acceleration technique trained on SD1.5 and SDXL, featured on the Hugging Face Trending list.
Apr 2024 My first work on diffusion models, CFLD for pose transfer, is accepted to CVPR 2024 as a Highlight. Both the model and code are publicly released.
Dec 2023 I join ByteDance as a research intern, working on diffusion model acceleration.

Selected Papers

 

RAVEN: Real-time Autoregressive Video Extrapolation with Consistency-model GRPO
Yanzuo Lu, Ronglai Zuo, Jiankang Deng
arXiv preprint arXiv:2605.15190, 2026
[arXiv]     [Project Page]     [Code]     [Model]

 

Seedream 4.0: Toward Next-generation Multimodal Image Generation
Yanzuo Lu (Core Contributor), ByteDance Seed
arXiv preprint arXiv:2509.20427, 2025
[arXiv]     [Project Page]

 

Seedream 3.0 Technical Report
Yanzuo Lu (Contributor), ByteDance Seed
arXiv preprint arXiv:2504.11346, 2025
[arXiv]     [Project Page]

 

Adversarial Distribution Matching for Diffusion Distillation Towards Efficient Image and Video Synthesis
Yanzuo Lu, Yuxi Ren, Xin Xia, Shanchuan Lin, Xing Wang, Xuefeng Xiao, Andy J Ma, Xiaohua Xie and Jianhuang Lai
International Conference on Computer Vision (ICCV), 2025 (Highlight)
[arXiv]     [Publication]

 

Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis
Yanzuo Lu, Manlin Zhang, Andy J Ma, Xiaohua Xie and Jianhuang Lai
Conference on Computer Vision and Pattern Recognition (CVPR), 2024 (Highlight)
[arXiv]     [Publication]     [Code]     [Talk]

 

Hyper-Bagel: A Unified Acceleration Framework for Multimodal Understanding and Generation
Yanzuo Lu, Xin Xia, Manlin Zhang, Huafeng Kuang, Jianbin Zheng, Yuxi Ren, Xuefeng Xiao
arXiv preprint arXiv:2509.18824, 2025
[arXiv]     [Project Page]

 

Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis
Yuxi Ren, Xin Xia, Yanzuo Lu, Jiacheng Zhang, Jie Wu, Pan Xie, Xing Wang and Xuefeng Xiao
Conference on Neural Information Processing Systems (NeurIPS), 2024
[arXiv]     [Publication]     [Project Page]     [Model (4M+ downloads on HuggingFace)]     [PR]

 

ByteEdit: Boost, Comply and Accelerate Generative Image Editing
Yuxi Ren, Jie Wu, Yanzuo Lu, Huafeng Kuang, Xin Xia, Xionghui Wang, Qianqian Wang, Yixing Zhu, Pan Xie, Shiyin Wang, Xuefeng Xiao, Yitong Wang, Min Zheng and Lean Fu
European Conference on Computer Vision (ECCV), 2024
[arXiv]     [Publication]     [Project Page]


Experience

Research Intern/Scientist, ByteDance Seed

Full-time: Jun 2025 - Sep 2025 · 4 mos · Shenzhen, China

Internship: Dec 2023 - Jun 2025 · 1 yr 7 mos · Shenzhen, China

Mentored by Yuxi Ren & Jie Wu and led by Xuefeng Xiao;

Research Topic: accelerating diffusion model to reduce sampling steps via progressive/consistency/rectified/score/adversarial distillation and RLHF for efficient image and video synthesis;

Industry Deployment: Douyin/TikTok (short-form content), Capcut (video editor), Dreamina (image & video generator), Doubao (chatbot)


Education

PhD Student, Imperial College London

Oct 2025 - Present, London, United Kingdom

In Department of Computing, Supervised by Jiankang Deng;

Research Topic: real-time and long-video world models;

Master of Engineering (MEng), Sun Yat-Sen University

Sep 2022 - Jun 2025, Guangzhou, China

In School of Computer Science and Engineering;

Supervised by Andy J Ma, Xiaohua Xie and Jianhuang Lai;

Research Topic: customized diffusion models, domain adaptation and person re-identification;

Bachelor of Engineering (BEng), Sun Yat-Sen University

Sep 2018 - Jun 2022, Guangzhou, China

In School of Computer Science and Engineering;

Relevant Coursework: Probability and Statistics, Machine Learning and Data Mining, Principles of Artificial Neural Networks, Optimization Theory, Artificial Intelligence, Computer Vision, Computer Graphics, etc. (Average Score: 91.18 / 100)


Professional Service


Awards and Honors