Deep Reinforcement Learning

==================================

$ cd DRL $ pip install -e . $ pip install ~/carla/PythonClient (optional) $ pip install opencv-python

==================================

references:

sac is based on:

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

sac1 is added based on:

Soft Actor-Critic Algorithms and Applications

==================================

ddgp vs sac1

sqn experiments on gym env 'LunarLander-v2':

Try trained model on env 'Breakout-ram-v4':

$ python -m spinup.run test_policy ./saved_models/Breakout-ram-v4 -d -l 20000

Learning Latent Dynamics for Planning from Pixels

INFOBOT: TRANSFER AND EXPLORATION VIA THE INFORMATION BOTTLENECK

Unsupervised Meta-Learning for Reinforcement Learning

DIVERSITY IS ALL YOU NEED: LEARNING SKILLS WITHOUT A REWARD FUNCTION (DIAYN)

Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks (MAML)

Provide feedback