Adversarial Inverse Reinforcement Learning to Estimate Policies from Multiple Experts
-
- Yamashita Kodai
- Graduate School of Engineering Science, Yokohama National University
-
- Hamagami Tomoki
- Facluty of Engineering, Yokohama National University
Bibliographic Information
- Other Title
-
- 複数のエキスパートから方策推定を行う敵対的逆強化学習
- フクスウ ノ エキスパート カラ ホウサク スイテイ オ オコナウ テキタイテキ ギャクキョウカ ガクシュウ
Search this article
Description
<p>Inverse reinforcement learning is used for complex control tasks by using experts. However, since the learning results depend on the expert, it is impossible to imitate ungiven policies from expert when there are multiple optimal polices for the same goal, or when the environment changes from the training. The problems can be solved by giving multiple experts and representing their features in the latent space. the proposed method extends information maximizing generative adversarial imitation learning with adversarial inverse reinforcement learning to deal with such environment. Experiments show that the proposed method can not only imitate multiple experts, but also estimate ungiven polices.</p>
Journal
-
- IEEJ Transactions on Electronics, Information and Systems
-
IEEJ Transactions on Electronics, Information and Systems 141 (12), 1405-1410, 2021-12-01
The Institute of Electrical Engineers of Japan
- Tweet
Keywords
Details 詳細情報について
-
- CRID
- 1390008764029176576
-
- NII Article ID
- 130008123513
-
- NII Book ID
- AN10065950
-
- ISSN
- 13488155
- 03854221
-
- NDL BIB ID
- 031857151
-
- Text Lang
- ja
-
- Article Type
- journal article
-
- Data Source
-
- JaLC
- NDL Search
- Crossref
- CiNii Articles
- KAKEN
- OpenAIRE
-
- Abstract License Flag
- Disallowed