I am a PhD student at the Institute of Parallel and Distributed Systems (IPADS), Shanghai Jiao Tong University, advised by Prof. Haibo Chen and Prof. Mingkai Dong.
My research interests are in ML systems, AI infrastructure, and operating systems. Recently, I have been working on system support for on-device LLM inference and agentic systems.
I am also a core contributor to PowerServe, a high-performance LLM inference framework for mobile devices.
๐ Homepage / Google Scholar / X / Xiaohongshu / Zhihu / Email
- ๐ Apr 2026: Our paper Inference in the Shadows: Taming Memory Bandwidth Contention in Mobile LLM Inference with Sereno has been accepted to OSDI 2026.
- ๐ Aug 2025: Our paper SwitchFS: Asynchronous Metadata Updates for Distributed Filesystems with In-Network Coordination has been accepted to EuroSys 2026.