モンテカルロ法における勝率近似関数の組み込み方法

但馬, 康宏, Tajima, Yasuhiro

書誌事項

タイトル別名

A combination method between Monte-Carlo simulations and a win-rate approximation function

説明

本稿では，UCB1 アルゴリズムを用いたモンテカルロシミュレーションにおいて，評価関数を効果的に取り込む方法を提案する．過去の研究においてもUCT アルゴリズムに対してヒューリスティックな評価関数を用いて着手の制限を行い，効率を高める提案がなされているが，本手法はUCB1 アルゴリズムの一部に評価関数をスムーズに取り込む方法である．評価実験として，ブロックスデュオにおいて，UCB1 アルゴリズムおよび評価関数のみによるアルゴリズムと対戦し，計算時間と勝敗を計測した．その結果，一定の成果があることが確認できた．

We show a combination method between UCB1 algorithm and an win-rate approximation function. Even though there are some studies which uses a heuristic evaluation function in UCT algorithm, our method takes the evaluation function into UCB1 algorithm smoothly. For evaluation to confirm our method, we made some matches between our algorithm and UCB1 algorithm or a heuristic search algorithm.

収録刊行物

ゲームプログラミングワークショップ2008論文集

ゲームプログラミングワークショップ2008論文集 2008 (11), 100-103, 2008-10-31

情報処理学会

詳細情報詳細情報について

CRID: 1050855522107778304

NII論文ID: 170000080342

Web Site: https://ipsj.ixsq.nii.ac.jp/records/97697

本文言語コード: ja

資料種別: conference paper

データソース種別

IRDB
CiNii Articles

書き出し

問題の指摘