PhD Student in AI at the University of Oxford. Multi-agent RL and other explorations.
-
University of Oxford
Stars
Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.
Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).
hanabi_learning_environment is a research platform for Hanabi experiments.
Code release for Learning with Opponent-Learning Awareness and variations.
Exploring the Input Switch Affine Networks model
From Pixels to Torques: Policy Learning using Deep Dynamical Convolutional Neural Networks (DDCNN)