Adaptation Method of the Exploration Ratio Based on the Orientation of Equilibrium in Multi-Agent Reinforcement Learning Under Non-Stationary Environments
-
- Okano Takuya
- Fujitsu Limited
-
- Noda Itsuki
- National Institute of Advanced Industrial Science and Technology (AIST)
この論文をさがす
説明
<p>In this paper, we propose a method to adapt the exploration ratio in multi-agent reinforcement learning. The adaptation of exploration ratio is important in multi-agent learning, as this is one of key parameters that affect the learning performance. In our observation, the adaptation method can adjust the exploration ratio suitably (but not optimally) according to the characteristics of environments. We investigated the evolutionarily adaptation of the exploration ratio in multi-agent learning. We conducted several experiments to adapt the exploration ratio in a simple evolutionary way, namely, mimicking advantageous exploration ratio (MAER), and confirmed that MAER always acquires relatively lower exploration ratio than the optimal value for the change ratio of the environments. In this paper, we propose a second evolutionary adaptation method, namely, win or update exploration ratio (WoUE). The results of the experiments showed that WoUE can acquire a more suitable exploration ratio than MAER, and the obtained ratio was near-optimal.</p>
収録刊行物
-
- Journal of Advanced Computational Intelligence and Intelligent Informatics
-
Journal of Advanced Computational Intelligence and Intelligent Informatics 21 (5), 939-947, 2017-09-20
富士技術出版株式会社
- Tweet
詳細情報 詳細情報について
-
- CRID
- 1390001288093343872
-
- NII論文ID
- 130007520190
-
- NII書誌ID
- AA12042502
-
- ISSN
- 18838014
- 13430130
-
- NDL書誌ID
- 028510939
-
- 本文言語コード
- en
-
- 資料種別
- journal article
-
- データソース種別
-
- JaLC
- NDLサーチ
- Crossref
- CiNii Articles
- KAKEN
-
- 抄録ライセンスフラグ
- 使用不可