Skip to content

PyTorch implementation of a Connect Four bot that uses minimax tree search with alpha-beta pruning and a learnable value function for evaluating terminal nodes.

Notifications You must be signed in to change notification settings

WhimsicalWill/ConnectFourAB

Repository files navigation

Implementation of minimax tree search and alpha-beta pruning with a learned value function. Implementation works for discrete zero-sum games, and is tested on ConnectFour.

The value function is learned, and is updated using an n-step reward objective. The method is on-policy, and only updates using experience from the current episode.

About

PyTorch implementation of a Connect Four bot that uses minimax tree search with alpha-beta pruning and a learnable value function for evaluating terminal nodes.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages