Skip to content

Latest commit

 

History

History
53 lines (28 loc) · 898 Bytes

README.md

File metadata and controls

53 lines (28 loc) · 898 Bytes

Implementation of A3C (Asynchronous Advantage Actor-Critic)

This is a tensorflow implementation of Asynchronous advantage actor-critic algorithm for CNN-LSTM as function approximator

Original Paper

here

Demo

Breakout_v0

Results

Training on Breakout-v0 is done with Nvidia GeForce GTX 1070 GPU for 28 hours

Total Scores Vs Number of iteration (Breakout_v0)

Scores

Episode Length Vs Number of iteration (Breakout_v0)

Episode_Length

Dependencies

  • python 3.5
  • tensorflow 1.1.0
  • opencv 3.2.0
  • openAI

Usage

For Training Run:

$ python3 trainer.py

For Demo Run:

$ python3 play.py

Credit

Got important help form this project