Ways to cheat in training

Here are the top ways to cheat during training if you don't have 5000 TPUs:

Train naked network on another chess engine's output, or just write your own evaluation function.
Adjudicate games during self-play based on heuristics like material
Train only on positions where the network makes blunders (check with engine). Thus reducing noise during training.
Instead of self-play, use engine vs. network games.
Set loss weight of policy to 0.1 and replace values by your own heuristic.
Train on high elo vs low elo and zero out the policy of the worse player. SCID is great for filtering millions of games.

Provide feedback