Skip to content

XinliYu/Reinforcement_Learning-Projects

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 

Repository files navigation

RL-Projects

This repository contains three high-quality reinforcement learning course projects.

Lunar Lander: my deep Q-learning model achieves 280+ points on average for the Lunar Lander Problem, the highest score among those we can find online and reported in the class discussion board. My paper-like report is here.

Best Model Demo

Feedback

Correlated-Q: replicates the results in Correlated-Q Learning. In addition, we demo the equilibrium evolution. For how to derive the linear programming dual, please read our paper-like report here.

CorrelatedQ Replication

SuttonMDP: replicates the results in Learning to Predict by the Methods of Temporal Differences. The same results are not easy to replicate as the paper is vague on the model's parameters. The right parmeter setup is found by repeatedly comparing the charts with the theory.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages