UCT探索における局面評価関数の使用方法と性能評価

松本, 渉, Wataru, Matsumoto

新たなゲーム木探索法としてモンテカルロ木探索，特にUCTが成功を収め広く研究されている．しかし，モンテカルロ木探索の主なターゲットが囲碁であるため，局面評価関数を使用したUCTの研究は少ない．本研究ではUCB値に局面評価関数を使用する際の新たな手法，モンテカルロシミュレーションに局面評価関数を使用する手法，プレイアウトを早く打ち切り，その際に局面評価関数を使用する手法の3つに着目し，その有効性を示す．実験ではすでに優れた局面評価関数が存在するオセロにおいて行い，局面評価関数には世界最強クラスのプログラムであるZebraのものを使用した．その結果，単純にUCB値に局面評価関数を用いたUCTよりも優れた性能を発揮できることが実証された．

The Monte Carlo tree search, particularly UCT, gains a great success and is being widely studied as a new game tree search method.However UCT using position evaluation function has hardly been studied because the main target of UCT search is the game of GO.We focus on a new method when the UCB value uses position evaluation function,a method that the Monte Carlo simulation uses position evaluation function,and a method that use position evaluation function when stopping the Monte Carlo simulation,and show effectiveness of these method.Experiments are performed by using the game of Othello, that already has strong position evaluation function.Evaluation function of the Zebra, the strong othello program, was used for the experiments.The results showed that these method outperformed UCT using simply position evaluation function in the UCB value.

UCT探索における局面評価関数の使用方法と性能評価

書誌事項

説明

収録刊行物

詳細情報詳細情報について

書き出し

問題の指摘

UCT探索における局面評価関数の使用方法と性能評価

書誌事項

説明

収録刊行物

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について