GitHub - clay-curry/flapPy-RL: an RL algorithm solving Flappy Bird. each episode decides a final score R upon crashing, so we can choose q : S × A → ℝ naturally to be the expected value E(R) from the state-action pair (s, a). the experiment confirms that a tabular, n-step Sarsa algorithm estimating q approximates q* with sufficient precision to decide a π* with arbitrary large R

clay-curry / flapPy-RL Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

an RL algorithm solving Flappy Bird. each episode decides a final score R upon crashing, so we can choose q : S × A → ℝ naturally to be the expected value E(R) from the state-action pair (s, a). the experiment confirms that a tabular, n-step Sarsa algorithm estimating q approximates q* with sufficient precision to decide a π* with arbitrary large R

0 stars 0 forks Branches Tags Activity

Star

Notifications

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
assets		assets
data		data
.gitignore		.gitignore
config.py		config.py
flappy.py		flappy.py
n_sarsa.py		n_sarsa.py
q_agent.py		q_agent.py
q_agent_flappy.py		q_agent_flappy.py