Value Iteration Algorithm for Reinforcement Learning

An implementation of the value iteration algorithm based on "Reinforcement Learning: An Introduction (Second edition)." Here, we iterate the Q table.

We are using the settings similar to the settings for Figure 17.1 on page 646 in the Artificial Intelligence: A Modern Approach (Third Edition) (But not the same!).

“A simple 4 × 3 environment that presents the agent with a sequential decision problem.”

“The "intended" outcome occurs with probability 0.8, but with probability 0.2 the agent moves at right angles to the intended direction. A collision with a wall results in no movement. The two terminal states have reward +1 and -1, respectively, and all other states have a reward of -0.04." (Artificial Intelligence: A Modern Approach (Third Edition), P646)

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
__pycache__		__pycache__
Final-Project.pdf		Final-Project.pdf
README.md		README.md
drawHeatMap.py		drawHeatMap.py
qValueIteration_Seo_Aaron.py		qValueIteration_Seo_Aaron.py
rewardTable.py		rewardTable.py
testQValueIteration_Seo_Aaron.py		testQValueIteration_Seo_Aaron.py
transitionTable.py		transitionTable.py
valueIterationHeatMap_R=-0.04.jpg		valueIterationHeatMap_R=-0.04.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Value Iteration Algorithm for Reinforcement Learning

About

Releases

Packages

Languages

aaron-seo/Value-Iteration-Algorithm

Folders and files

Latest commit

History

Repository files navigation

Value Iteration Algorithm for Reinforcement Learning

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages