MC Softmax 探索における局面の並列評価： GPU とニューラルネットワークモデルの利用

吉野, 拓真, 五十嵐, 治一, 川島, 馨

選択探索の一種として，モンテカルロソフトマックス探索が提案されている．一般に，大規模ニューラルネットワークモデルによる評価関数を利用する場合，計算に時間がかかることから，αβ探索のような全幅探索よりは，選択探索の方が向いている．特に，モンテカルロソフトマックス探索においては，兄弟ノード局面をまとめて評価する際に，GPU による並列計算を用いれば，評価関数が大規模なニューラルネットワークモデルであっても容易に並列化できる可能性がある．本研究では，dlshogi のソースコードを改変し，モンテカルロソフトマックス探索とニューラルネットワークモデルの評価関数を組み合わせたプログラムを作成した．特に，ニューラルネットワークモデルの入力層に提示する局面の特徴量表現を工夫することにより，GPU で兄弟局面を同時に並列計算する際の処理時間を短縮することを試みた．さらに，ノード選択方策にPolicy Network の出力値を取り入れることにより，探索精度の向上を試みた．

As a kind of selective search, the Monte Carlo softmax search has been proposed. In general, selective search is better than full-width search, such as the alpha-beta search, because of the computational time required to use the evaluation function of large-scale neural network models. In particular, in the Monte Carlo softmax search, parallelization can be easily achieved even in the case of large-scale neural network models by using GPU parallelization in the evaluation of sibling nodes. In this study, we have modified the source code of dlshogi to create a program that combines Monte Carlo softmax search and evaluation functions of neural network models. We devised a feature representation of the input layer of the neural network model to reduce the processing time of the parallel computation for the sibling nodes on the GPU. Furthermore, we att empted to improve the accuracy of the search by incorporating the output of the Policy Network into a node selection policy.

MC Softmax 探索における局面の並列評価： GPU とニューラルネットワークモデルの利用

書誌事項

抄録

収録刊行物

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

MC Softmax 探索における局面の並列評価： GPU とニューラルネットワークモデルの利用

書誌事項

抄録

収録刊行物

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について