Deep Reinforcement Learning Nanodegree: Project 2 - Continuous Control

This project includes the code for an implementation of the deep deterministic policy gradient(DDPG) algorithm which I wrote to solve the Project 2 - Continuous Control of the Deep Reinforcement Learning Nanodegree @ Udacity. My version of the DDPG algorithm is inspired by chapter 14 of Maxim Lapan’s book called "Deep Reinforcement Learning Hands-On".

For more information on the implemented features refer to "Report.ipynb". The notebook includes a summary of all essential concepts used in the code.

Project 2 - Continuous Control - Details:

The goal of this project was to train an agent, represented by a double-jointed arm, to maintain its position at the target location(great green sphere) for as many time steps as possible.

Random Agent

Trained Agent

Reward:

a reward of +0.1 is provided for each step that the agent's hand is in the goal location

Search Space

the state space has 33 dimensions
- corresponding to the position, rotation, velocity, and angular velocities of the arm
the action space has four dimensions
- every action is a continuous number between -1 and 1

Task

the task is episodic
the agent has to maintain its position at the target location(great green sphere) for as many time steps as possible
to solve the environment, the agent must get an average score of +30 over 100 consecutive episodes

Getting Started

Create (and activate) a new environment with Python 3.6.

conda create --name env_name python=3.6
source activate env_name

Download the environment from one of the links below and place it into \p2_continuous-control\Reacher_One_Linux

Linux: click here
Mac OSX: click here
Windows (32-bit): click here
Windows (64-bit): click here
your folder should now look something like this:

\Reacher_One_Linux
     \Reacher_Data
     \Reacher.x86
     \Reacher.x86_64

Install Sourcecode dependencies

conda install -c pytorch pytorch
conda install -c anaconda numpy
pip install tensorboardX

unityagents is also required
- an easy way to get this is to install the Deep Reinforcement Learning Nanodegree with its dependencies

git clone https://github.com/udacity/deep-reinforcement-learning.git
cd deep-reinforcement-learning/python
pip install .

How to run the project

You can run the project by running the main.py file through the console.

open the console and run: python main.py -c "your_config_file.json"
to train the agent from scratch set "run_training" in the config file to true
to run the pre-trained agent set "run_training" in the config file to false

optional arguments:

-h, --help

- show help message

-c , --config

- Config file name - file must be available as .json in ./configs

Example: python main.py -c "reacher_one.json"

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
checkpoints/Reacher_One_Linux		checkpoints/Reacher_One_Linux
configs		configs
images		images
monitor/Reacher_One_Linux/2018_10_09__19_24_58		monitor/Reacher_One_Linux/2018_10_09__19_24_58
README.md		README.md
Report.ipynb		Report.ipynb
agent.py		agent.py
experience.py		experience.py
main.py		main.py
network.py		network.py
noise.py		noise.py
protocol.py		protocol.py
session.py		session.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Reinforcement Learning Nanodegree: Project 2 - Continuous Control

Project 2 - Continuous Control - Details:

Random Agent

Trained Agent

Reward:

Search Space

Task

Getting Started

How to run the project

About

Releases

Packages

Languages

cpow-89/Deep_Reinforcement_Learning_Nanodegree_Project_2_Continuous_Control

Folders and files

Latest commit

History

Repository files navigation

Deep Reinforcement Learning Nanodegree: Project 2 - Continuous Control

Project 2 - Continuous Control - Details:

Random Agent

Trained Agent

Reward:

Search Space

Task

Getting Started

How to run the project

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages