FlowHOI: Flow-based Semantics-Grounded Generation of Hand-Object Interactions for Dexterous Robot Manipulation
Huajian Zeng1, Lingyun Chen2, Jiaqi Yang1, Yuantai Zhang1, Fan Shi3, Peidong Liu4, Xingxing Zuo1,*
1Mohamed Bin Zayed University of Artificial Intelligence (MBZUAI), 2Technical University of Munich (TUM), 3National University of Singapore (NUS), 4Westlake University
*Corresponding author
In Submission
FlowHOI is a two-stage flow-matching framework that generates semantically grounded, temporally coherent hand-object interaction (HOI) sequences for dexterous robot manipulation.
By decoupling geometry-centric grasping from semantics-centric manipulation and conditioning on 3D Gaussian splatting scene reconstruction, FlowHOI achieves 1.7× higher physics simulation success rate and 40× inference speedup compared to the strongest diffusion-based baseline.
These capabilities enable real-robot execution on diverse dexterous manipulation tasks, bridging the gap between HOI generation and practical robotic deployment.