Rise v3.3 #104

QueensGambit · 2021-05-13T14:04:11Z

This PR introduces the RISEv3.3 architecture as an improvement over the RISEv2 architecture.

The development process was influenced by the following papers.
However, most of the proposals turned out to be not beneficial for chess neural networks or suboptimal when applied for GPU inference.

MixConv: Mixed Depthwise Convolutional Kernels, Mingxing Tan, Quoc V. Le, https://arxiv.org/abs/1907.09595
Direct Neural Architecture Search on Target Task and Hardware, Han Cai, Ligeng Zhu, Song Han.
https://arxiv.org/abs/1812.
MnasNet: Platform-Aware Neural Architecture Search for Mobile,
Mingxing Tan, Bo Chen, Ruoming Pang, Vijay Vasudevan, Mark Sandler, Andrew Howard, Quoc V. Le
http://openaccess.thecvf.com/content_CVPR_2019/html/Tan_MnasNet_Platform-Aware_Neural_Architecture_Search_for_Mobile_CVPR_2019_paper.html
FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search,
Bichen Wu, Xiaoliang Dai, Peizhao Zhang, Yanghan Wang, Fei Sun, Yiming Wu, Yuandong Tian, Peter Vajda, Yangqing Jia, Kurt Keutzer,
http://openaccess.thecvf.com/content_CVPR_2019/html/Wu_FBNet_Hardware-Aware_Efficient_ConvNet_Design_via_Differentiable_Neural_Architecture_Search_CVPR_2019_paper.html
MobileNetV3: Searching for MobileNetV3,
Andrew Howard, Mark Sandler, Grace Chu, Liang-Chieh Chen, Bo Chen, Mingxing Tan, Weijun Wang, Yukun Zhu, Ruoming Pang, Vijay Vasudevan, Quoc V. Le, Hartwig Adam.
https://arxiv.org/abs/1905.02244
Convolutional Block Attention Module (CBAM),
Sanghyun Woo, Jongchan Park, Joon-Young Lee, In So Kweon
https://arxiv.org/pdf/1807.06521.pdf
ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks (ecaSE) - Wang et al.
https://arxiv.org/abs/1910.03151

The changes which where incorporated in RISEv3.3 where the following:

Replacing squeeze excitation modules by efficient squeeze excitation modules as proposed by Wang et al. (https://arxiv.org/abs/1910.03151)
Replacing sigmoid by hard-sigmoid as recommend in Searching for MobileNetV3 by Howard et al. (https://arxiv.org/abs/1905.02244)
Making use of 5x5 convolutions in deeper layers as recommended in Platform-Aware Neural Architecture Search for Mobile by Tan et al. (http://openaccess.thecvf.com/content_CVPR_2019/html/Tan_MnasNet_Platform-Aware_Neural_Architecture_Search_for_Mobile_CVPR_2019_paper.html)
Using the flag boolean flag global for average pooling layers
Using a higher initial channel size, more residual blocks but a lower increase of number of channels per layer (32 instead of 64).

The architecture resulted in an ~150 Elo improvement when trained on the same data set, here Kingbase2019lite.
The other only difference other difference was changing the value loss ratio from 0.01 to 0.1.

Score of ClassicAra 0.9.1 - Risev3.3 vs ClassicAra 0.9.1 - Risev2: 81 - 15 - 64 [0.706]
Elo difference: 152.4 +/- 42.8, LOS: 100.0 %, DrawRatio: 40.0 %

160 of 1000 games finished.

changed "python-chess" to "chess"

specified version number

set back to old python-chess version

added efficient_channel_attention_moduel() added ic_layer added hard_sigmoid

* added get_se_layer

* fixed "eca_se" look-up * update kernel size * update train_cnn.ipynb

* added bn layer for value head * fixed train_cnn.ipynb loading

QueensGambit added 16 commits January 18, 2021 15:10

updated requirements.txt

30ba3ac

changed "python-chess" to "chess"

updated requirements.txt

b0007e4

specified version number

updated requirements.txt

0c62671

set back to old python-chess version

added bottleneck_residual_block_v2()

804eed5

added efficient_channel_attention_moduel() added ic_layer added hard_sigmoid

Merge branch 'master' into rise_v3

e740905

* added sandglass_block

cb64016

* added get_se_layer

* added efficient_scaling.py

73bae32

* updated efficient_scaling.py

f916f00

* update efficient_scaling.py

454d6c1

* changed learning rate

7a9bab8

* added global_pool=True

c940f0e

* fixed "eca_se" look-up * update kernel size * update train_cnn.ipynb

* added preact_resnet_se.py

15a0b4a

* enabled raw_features for pre_act_resnet_se

0c0f7de

* added bn layer for value head * fixed train_cnn.ipynb loading

* implemented Risev3.3

05b41a2

Merge branch 'master' into rise_v3

6fed7cb

* clean-up and comments

c9b2b90

QueensGambit merged commit ff89157 into master May 13, 2021

QueensGambit added a commit that referenced this pull request May 13, 2021

* addressed automatic code review (#104)

1146167

QueensGambit mentioned this pull request May 13, 2021

Rise v3.3 - Code Review #105

Closed

QueensGambit added a commit that referenced this pull request May 13, 2021

addresses automatic code review for #104

c3132c1

QueensGambit mentioned this pull request Jun 9, 2021

Chess input representation v3.0 #134

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rise v3.3 #104

Rise v3.3 #104

QueensGambit commented May 13, 2021 •

edited

Loading

Rise v3.3 #104

Rise v3.3 #104

Conversation

QueensGambit commented May 13, 2021 • edited Loading

QueensGambit commented May 13, 2021 •

edited

Loading