Python notebook divided into two sections: The first section shows the MTCS algorithm applied to a binary tree. Then it expands the study to an MTCS variant. The second section studies firstly the MonteCarlo algorithm for policy evaluation. Secondly, the implementation of a learning agent in a gridworld and then it analyzes the performance of SA…
-
Updated
Feb 7, 2021 - Jupyter Notebook