Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
-
Updated
Mar 29, 2023 - Python
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
Random network distillation on Montezuma's Revenge and Super Mario Bros.
AI agents for the boardgame Splendor
Important document for research using PPO. Includes working version with neuron-astrocyte model. These are modified files from the cleanrl repository.
Add a description, image, and links to the ppo-gru topic page so that developers can more easily learn about it.
To associate your repository with the ppo-gru topic, visit your repo's landing page and select "manage topics."