forked from Zeta36/chess-alpha-zero
-
Notifications
You must be signed in to change notification settings - Fork 14
Ways to cheat in training
Michael Pang edited this page Dec 20, 2017
·
7 revisions
Here are the top ways to cheat during training if you don't have 5000 TPUs:
- Train naked network on another chess engine's output, or just write your own evaluation function.
- Adjudicate games during self-play based on heuristics like material
- Train only on positions where the network makes blunders (check with engine). Thus reducing noise during training.
- Instead of self-play, use engine vs. network games.
- Set loss weight of policy to 0.1 and replace values by your own heuristic.
- Train on high elo vs low elo and zero out the policy of the worse player. SCID is great for filtering millions of games.