jakobnicolaus

Jakob Foerster jakobnicolaus

PhD Student in AI at the University of Oxford. Multi-agent RL and other explorations.

Achievements

Highly scalable 2D JAX physics engine.

Python 65 8 Updated Feb 20, 2026

Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.

Python 242 10 Updated Feb 26, 2026

Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).

Jupyter Notebook 21 2 Updated Jan 15, 2020

hanabi_learning_environment is a research platform for Hanabi experiments.

Python 666 162 Updated Feb 14, 2023

A list of Hanabi strategies

TypeScript 180 191 Updated Apr 16, 2026

Code release for Learning with Opponent-Learning Awareness and variations.

Jupyter Notebook 151 38 Updated Apr 13, 2023

Exploring the Input Switch Affine Networks model

Jupyter Notebook 5 5 Updated Jan 18, 2017

From Pixels to Torques: Policy Learning using Deep Dynamical Convolutional Neural Networks (DDCNN)

Jupyter Notebook 42 5 Updated Nov 3, 2016