- Ph.D. Student in Machine Learning and Control at Hybrid Robotics Lab, BAIR, UC Berkeley
- Working on Reinforcement Learning and Stochastic Optimal Control
Robust and Safe RL, Stochastic Optimal Control, Data-Efficient RL, Scalable RL, Skill Discovery and Search
Can agent performance almost surely monotone increase using any data stream?
Offline RL, Off2On RL, Off-Policy Q-Learning, Self/Unsupervised RL, SOC for Finance, Dynamical Systems