Lists (8)
Sort Name ascending (A-Z)
Stars
A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementations with an aim to improve accessibility in RL
Repo for reproduction of sequential social dilemmas
CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character control
Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.
Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch
Real-World RL Benchmark Suite
Agent Learning Framework https://alf.readthedocs.io
Lightweight Nearest Neighbors with Flexible Backends
Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"
A library for ready-made reinforcement learning agents and reusable components for neat prototyping
Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch
[ICLR-2025] POGEMA stands for Partially-Observable Grid Environment for Multiple Agents. This is a grid-based environment that was specifically designed to be flexible, tunable and scalable. It can…
Code for ICLR 2019 paper: Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks
Load tensorboard event logs as pandas DataFrames for scientific plotting; Supports both PyTorch and TensorFlow
Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)
A library that makes Evolutionary Strategies (ES) simple to use.
Official code for ICML 2022: Mitigating Neural Network Overconfidence with Logit Normalization
My solution to the Unity Obstacle Tower Challenge
This project downloads and stores the daily SBI forex rates in a CSV file enabling you to access historical rates, easily.
JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)
Fast Flexible Replay Buffer Library (Mirror repository of https://gitlab.com/ymd_h/cpprb)