モンテカルロ将棋における方策の学習

関, 栄二, 三輪, 誠, 近山, 隆

近年，特に UCT の登場以降，囲碁においてモンテカルロ法を用いた強いコンピュータプレイヤが作られている．こうした成功を受け，将棋においてもモンテカルロ法の適用が模索されている．本稿では，モンテカルロ将棋における方策学習への，Simulation Balancing の適用を提案する．1800 局面程度で学習し予備的評価を行ったが，利用した特徴数が多く学習前よりも弱くなるという結果となった．

Since the advent of UCT, strong computer players using Monte-Carlo Methods have been build for the game of Go. Following these attainments, schemes to apply the method to the game of Shogi have been explored. In this paper, we propose to apply Simulation Balancing to the studying policy of Monte-Carlo Shogi players. We learn by this method in about 1800 positions and did a preliminary evaluation. However, the number of used features was too large, and the player became weaker than before learning.

モンテカルロ将棋における方策の学習

書誌事項

抄録

収録刊行物

詳細情報詳細情報について

書き出し

問題の指摘

モンテカルロ将棋における方策の学習

書誌事項

抄録

収録刊行物

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について