動的なマルチエージェント環境におけるモデルメディエータを利用したモデルベース強化学習

今井 翔太, 岩澤 有祐, 松尾 豊

doi:10.1527/tjsai.38-5_a-mb1

書誌事項

タイトル別名

Model-Based Reinforcement Learning using Model Mediator in Dynamic Multi-Agent Environment

説明

<p>Centralised training and decentralised execution (CTDE) is one of the most effective approaches in multiagent reinforcement learning (MARL). However, these CTDE methods still require large amounts of interaction with the environment, even to reach the same performance as very simple heuristic-based algorithms. Although modelbased RL is a prominent approach to improve sample efficiency, its adaptation to a multi-agent setting combining existing CTDE methods has not been well studied in the literature. The few existing studies only consider settings with relaxed restrictions on the number of agents and observable range. In this paper, we consider CTDE settings where some information about each agent’s observations (e.g. each agent’s visibility, number of agents) are changed dynamically. In such a setting, the fundamental challenge is how to train models that accurately generate each agent’s observations with complex transitions in addition to the central state, and how to use it for sample efficient policy learning. We propose a multi-agent model based RL algorithm based on the novel model architecture consisting of global and local prediction models with model mediator. We evaluate our model-based RL approach applied to an existing CTDE method on challenging StarCraft II micromanagement tasks and show that it can learn an effective policy with fewer interactions with the environment.</p>

収録刊行物

人工知能学会論文誌

人工知能学会論文誌 38 (5), A-MB1_1-14, 2023-09-01

一般社団法人人工知能学会

キーワード

詳細情報詳細情報について

CRID: 1390297305329983360

DOI: 10.1527/tjsai.38-5_a-mb1

ISSN: 13468030; 13460714

Web Site: https://www.jstage.jst.go.jp/article/tjsai/38/5/38_38-5_A-MB1/_pdf

本文言語コード: ja

データソース種別

JaLC
Crossref
OpenAIRE

抄録ライセンスフラグ: 使用不可

書き出し

問題の指摘

動的なマルチエージェント環境におけるモデルメディエータを利用したモデルベース強化学習

書誌事項

この論文をさがす

説明

収録刊行物

参考文献 (5)*注記

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

動的なマルチエージェント環境におけるモデルメディエータを利用したモデルベース強化学習

書誌事項

この論文をさがす

説明

収録刊行物

参考文献 (5)*注記

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について