Tensorflow-DPPO

self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow

the loss calculation is used from OPENAI PPO. the Distributed architecture design is inspired from Deepmind paper.