DDQN

This repository contains a deep reinforcement learning agent based on a double deep Q-network (=DDQN) used for collecting food in a 3D Unity environment.

Environment

The environment is a 3D room with bananas. It is based on the Unity engine and is provided by Udacity. The continuous states, discrete actions and the rewards are given as follows:

State

36 floating point values = pixels of camera attached to agent
1 floating point value = forward velocity of agent

Action

0 = move forward
1 = move backward
2 = turn left
3 = turn right

Reward

+1 = agent collects yellow banana
-1 = agent collects blue banana

The environment is episodic. The return per episode, which is the non-discounted cumulative reward, is referred to as a score. The environment is considered as solved if the score averaged over the 100 most recent episodes reaches +13.

Demo

The repository adresses both training and inference of the agent. The training process can be observed in a Unity window, as shown in the following video.

Training.mp4

When the training is stopped, the neural network of the agent is stored in a file called agent.pt.

The file agent.pt provided in this repository is the neural network of a successfully trained agent.

The application of the agent on the environment, i.e. the inference process, can also be observed in a Unity window with this repository:

Inference.mp4

Installation

In order to install the project provided in this repository on Windows 10, follow these steps:

For Windows users: If you do not know whether you have a 64-bit operating system, you can use this help
Install Anaconda
Open the Anaconda prompt and execute the following commands:

conda create --name drlnd python=3.6
activate drlnd

git clone https://github.com/udacity/deep-reinforcement-learning.git

cd deep-reinforcement-learning/python

Remove torch==0.4.0 in the file requirements.txt located in the current folder .../python
Continue with the following commands:

pip install .
pip install keyboard
conda install pytorch=0.4.0 -c pytorch

python -m ipykernel install --user --name drlnd --display-name "drlnd"

cd ..
cd ..

git clone git@github.com:rb-rl/DDQN.git
cd DDQN

Download the Udacity Unity Banana environment matching your environment:
Unzip the zip file into the folder DDQN (for Windows (64-bit), the Banana.exe in the zip file should have the relative path DDQN\Banana_Windows_x86_64\Banana.exe, and for the other environments this path should be similar, but can also be adapted as shown further below)
For Amazon Web Services users: You have to deactivate the virtual screen and perform the training in headless mode. For inference, you have to activate the virtual screen and use the Linux Version above
Start a jupyter notebook with the following command:

jupyter notebook

Open Main.ipynb
In the Jupyter notebook, select Kernel -> Change Kernel -> drlnd
If you are not using Windows (64-bit), search for UnityEnvironment("Banana_Windows_x86_64\Banana.exe") in the notebook and update the path to the corresponding file of the environment you downloaded above

Usage

In order to do the training and inference by yourself, simply open Main.ipynb and successively execute the Jupyter Notebook cells by pressing Shift+Enter.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
Main.ipynb		Main.ipynb
README.md		README.md
Report.md		Report.md
agent.pt		agent.pt
agent.py		agent.py
memory.py		memory.py
network.py		network.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DDQN

Environment

Demo

Installation

Usage

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

rb-rl/DDQN

Folders and files

Latest commit

History

Repository files navigation

DDQN

Environment

Demo

Installation

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages