Add a Open_Spiel Stratego implementation as well adapting MCTSAgent by defining different types of it #146

BluemlJ · 2021-07-10T13:46:15Z

With this pull request, I add an interface to CrazyAra which enables the use of my own Stratego implementation which is based on Open_Spiel. This PR adds two major extensions to CrazyAra:

First the extension of a Stratego environment:

The interface is based on a new state (engine/src/environments/stratego_related/), based on the existing interface for open_spiel games. It also adds my own fork of open_spiel as a submodule because it is not merged with the official open_spiel project yet. (.gitmodules)

To use CrazyAra for Stratego a new game mode called MODE_STRATEGO is added to all necessary files as well as the CMakeList:

engine/CMakeLists.txt
engine/src/agents/agent.cpp (some improved console output for better understanding Stratego moves)
engine/src/nn/tensorrtapi.cpp
engine/src/rl/selfplay.cpp
engine/src/stateobj.h
engine/src/uci/optionsuci.cpp
engine/src/uci/variants.h

It is compatible with the Profiling modes as well as the RL mode and is tested with the TensorRT backend.
A building test was also added to the GitHub workflow as well as the possiblity to start the variants workflow out of Github to make testing easier (variants.yml).

The second extension was the addition of MCTSAgent Types. In my thesis, I tested different agent types on imperfect information games. For this, I added new agents which inherit the basic MCTSAgent

engine/src/agents/mctsagentbatch, runs multiple mcts agents and combine the results at the end
engine/src/agents/randomagent, plays random moves wihtout mcts
engine/src/agents/mctsagenttruesight, removes hidden information from imperfect information games to play on a perfect infomation game

I also adapted MCTSAgent for this and added some specific handling for Stratego (like playing starting positions twice with alternating colors). I also added three new modes which are currently only tested on Stratego to run comparisons between different MCTSAgent types and models more easily, directly on the console without using additional software.

mctsarena, gets 2 different agent types, the number of matches to play and uses the same model on both agents
mctstournament, gets a list of agent types to play a round-robin tournament of n games between all given agents. All agents use the same model.
tournament, gets tuples of agen types and models to play a round-robin tournament of n games between all given tuples. The models must be defined in separate folders next to the binary.

I also added a Dockerfile for Stratego which extends the current docker file by installing open_spiel and the dependencies.
(engine/src/rl/Dockerfile_Stratego)

…e stratego needs another board

…e stratego needs another board - this time correct

…e stratego needs another board

…e stratego needs another board - this time correct

engine/src/environments/stratego_related/strategostate.cpp

engine/src/uci/crazyara.cpp

Remove static numberofgames from arena modes and add it to the inputstring

Remove unnecessary code

Remove blank lines

engine/src/uci/crazyara.h

Add some comments to the alternative arena methods : mctsarena, mctstournament and evaltournament

engine/src/agents/agent.cpp

remove blank line

engine/src/environments/stratego_related/strategostate.cpp

Remove the code to print InformationStateTensors as a debugging feature.

engine/src/rl/selfplay.h

remove blank line

changed constats to return uint instead of int

reduced scope of bestMoveIdx

removed unused shape removed unnecessary assignment

removed unused variable fen2

avoid reassigning a value before the old one has been used

pass string by const reference

Jannis Blüml added 30 commits July 9, 2021 18:22

StrategoAra Integration

bdd9081

test Workflows

bd15ce6

set mode to crazyara

c3bcf18

add workflow dispatch trigger

6d9e605

add workflow trigger v2

8ad0016

add workflow trigger v3

8afa28f

add workflow trigger v4

178a8d3

add workflow trigger v5 - change path to openspiel

b6c9b03

add workflow trigger v6 - change path to openspiel

9c6ae51

add workflow trigger v7 - remove OpenSpiel install from action

49907e1

repair variants.h

5cc0d6f

add workflow trigger v8 - add OpenSpiel install to variants.yml again

9078d84

add workflow trigger v9 - add OpenSpiel install to variants.yml again

2e001f6

add submodule open_spiel_yorktown

82db7a9

remove open_spiel

403a617

add open_spiel.thesis

d287368

change variants.yml v11 - install open_spiel

3af723a

change variants.yml v12 - add permission

0ca5dc7

change variants.yml v13 - add permission via sudo

1048ad5

disable some tests based on the boardstate which arent working becaus…

edfec2f

…e stratego needs another board

disable some tests based on the boardstate which arent working becaus…

945d99a

…e stratego needs another board - this time correct

disable some tests based on the boardstate which arent working becaus…

c652eb8

…e stratego needs another board

disable some tests based on the boardstate which arent working becaus…

79f4378

…e stratego needs another board - this time correct

attempt 2

93184c5

repair #ifndef

ebdce15

repair #ifndef again

e5fa918

remove tests from stratego build

1cb59da

scope problems

8973fed

scope problems v2

881f0e7

scope problems v3

24d272b