Learning to Run NIPS 2017 Competition
-
Updated
Aug 18, 2017 - Python
Learning to Run NIPS 2017 Competition
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
Implementing reinforcement-learning algorithms for pysc2 -environment
Attempt to implement A2C and PPO algorithm with modular properties of Maxout and LWTA. # UNFINISHED AND FAILED
Implementation of proximal policy optimization(PPO) with tensorflow
Proximal Policy Optimization with Stein Control Variates:
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR). Python2 compatible (branch python2)
Simple reinforcement learning framework for selfplay experiments
RLbox: Solving OpenAI Gym with TensorFlow
simple and compact implementations of reinforcement learning benchmark algorithms
Reinforcement learning library for PyTorch.
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).
小时候练手的rl项目
Minimal implementation of PPO, running in Mujoco env, using Gym-mujoco
Generative-Adversarial-Imitation-Learning on PySC2
Implementation of Proximal Policy Optimization(PPO)
PyTorch Implementations of Standard Deep RL Algorithms (including REINFORCE, A2C, PPO)
Add a description, image, and links to the ppo topic page so that developers can more easily learn about it.
To associate your repository with the ppo topic, visit your repo's landing page and select "manage topics."