Stars
ControlArena is a collection of settings, model organisms and protocols - for running control experiments.
Code accompanying the paper "Interactively Learning Preference Constraints in Linear Bandits" (ICML 2022).
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Massively parallel rigidbody physics simulation on accelerator hardware.
Code accompanying "Learning What To Do by Simulating the Past", ICLR 2021.
The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as this has been done by OpenAI and provides a good benchmark to …
General purpose environment wrappers for openai gym
Simple example showing how to integrate Ray parallelization with the Sacred experiment framework
A set of high-dimensional continuous control environments for use with Unity ML-Agents Toolkit.
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement …
Training (hopefully) safe agents in gridworlds
Simple gridworlds for AI safety research, implemented with the OpenAI gym interface.
An educational resource to help anyone learn deep reinforcement learning.
Joplin - the privacy-focused note taking app with sync capabilities for Windows, macOS, Linux, Android and iOS.
andrewschreiber / agent
Forked from tensorflow/tensorboardInterpretability dashboard for reinforcement learners
⏰ AI conference deadline countdowns
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms