Jie Liu (刘杰)
jieliu [at] link [dot] cuhk [dot] edu [dot] hk
Hello! I'm currently a third-year Ph.D. student at MMLab, The Chinese University of Hong Kong, supervised by Prof. Wanli Ouyang. My research primarily focuses on Reinforcement Learning, Generative Models, and LLM.
News
| 🥳2025.5: | Flow-GRPO and VideoAlign are accepted at NeurIPS 2025! |
|---|---|
| 🥳2025.5: | We release Flow-GRPO, the first method integrating online RL into flow matching models! |
| 2025.2: | We release VideoAlign, a systematic pipeline that harnesses human feedback to improve video generation! |
| 2024.8: | Our paper Emulated Disalignment won the Outstanding Paper Award at ACL 2024! |
| 2024.5: | Four Papers on Large Language Model are accepted at ACL 2024! |
| 2023.12: | One Paper on Offline-to-Online RL (SO2) is accepted at AAAI 2024! |
| 2023.10: | We release MODPO, a multi-objective direct preference optimization algorithm for language models! |
| 2023.10: | We release MaskMA, a masked pretraining framework for multi-agent decision-making! |
| 2023.08: | Become a Ph.D. student at MMLab in the Chinese University of Hong Kong. |
| 2023.04: | One Paper on Autonomous Driving (ASAP-RL) is accepted at RSS 2023! |
| 2022.11: | One Paper on Multi-agent RL (ACE) is accepted at AAAI 2023! |
| 2021.03: | Inception Convolution is accepted at CVPR 2021 as an oral paper! |
Selected Publications
-
Arxiv
Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 LevelArxiv, 2024 -
NN
-
TMLR
Masked Pretraining for Multi-Agent Decision MakingTransactions on Machine Learning Research, 2024 -
ECAI
Theoretically Guaranteed Policy Improvement Distilled from Model-Based PlanningProceedings of the European Conference on Artificial Intelligence (ECAI), 2023 -
Arxiv
Academic Service
- Conference Reviewer
NeurIPS 2022, ICML 2023, NeurIPS 2023, ICLR 2024, CVPR 2024, ICML 2024, NeurIPS 2024, ICLR 2024, ICML 2025