Advantage-Weighted Regression (AWR)

Code accompanying the paper: "Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning". The framework provides an implementation of AWR and supports running experiments on standard OpenAI Gym environments.

Project page: https://xbpeng.github.io/projects/AWR/index.html

Getting Started

Install requirements:

pip install -r requirements.txt

and it should be good to go.

Training Models

To train a policy, run the following command:

python run.py --env HalfCheetah-v2 --max_iter 20000 --visualize

HalfCheetah-v2 can be replaced with other environments.
--max_iter specifies the maximum number of training iterations.
--visualize enables visualization, and rendering can be disabled by removing the flag.
The log and model will be saved to the output/ directory by default. But the output directory can also be specified with --output_dir [output-directory].

Loading Models

To load a trained model, run the following command:

python run.py --test --env HalfCheetah-v2 --model_file data/policies/halfcheetah_awr.ckpt --visualize

--model_file specifies the .ckpt file that contains the trained model. Pretrained models are available in data/policies/.

Code

learning/rl_agent.py is the base agent class, and implements basic RL functionalties.
learning/awr_agent.py implements the AWR algorithm. The _update() method performs one update iteration.
awr_configs.py can be used to specify hyperparameters for the different environments. If no configurations are specified for a particular environment, than the algorithm will use the default hyperparameter settings in learning/awr_agent.py.

Data

data/policies/ contains pretrained models for the different environments.
data/logs/ contains training logs for the different environments, which can be used to plot learning curves.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
data		data
learning		learning
util		util
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
awr_configs.py		awr_configs.py
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Advantage-Weighted Regression (AWR)

Getting Started

Training Models

Loading Models

Code

Data

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

xbpeng/awr

Folders and files

Latest commit

History

Repository files navigation

Advantage-Weighted Regression (AWR)

Getting Started

Training Models

Loading Models

Code

Data

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages