My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.
reinforcement-learning tensorflow lstm dqn rl rnd a3c per ddqn distributed-tensorflow ppo dppo random-network-distillation dueling-ddqn n-step rnd-ppo n-step-target n-step-return
-
Updated
Mar 24, 2023 - Jupyter Notebook