[NeurIPS 2025] Flow x RL. Official Implementation of "ReinFlow: Fine-tuning Flow Policy with Online Reinforcement Learning".
flow robotics rl manipulation locomotion robot-learning humanoid fine-tuning post-training actorcritic policygradient decisionmaking finetuning-rl visuomotor finetuning-vision-models flowmatching onlinerl flowmodel flowpolicy
-
Updated
Sep 26, 2025 - Python