Adversarial Inverse Reinforcement Learning to Estimate Policies from Multiple Experts

Yamashita Kodai, Hamagami Tomoki

doi:10.1541/ieejeiss.141.1405

Adversarial Inverse Reinforcement Learning to Estimate Policies from Multiple Experts

DOI Web Site Web Site 1 References

Yamashita Kodai

Graduate School of Engineering Science, Yokohama National University
Hamagami Tomoki

Facluty of Engineering, Yokohama National University

Bibliographic Information

Other Title

複数のエキスパートから方策推定を行う敵対的逆強化学習
フクスウノエキスパートカラホウサクスイテイオオコナウテキタイテキギャクキョウカガクシュウ

Search this article

Description

<p>Inverse reinforcement learning is used for complex control tasks by using experts. However, since the learning results depend on the expert, it is impossible to imitate ungiven policies from expert when there are multiple optimal polices for the same goal, or when the environment changes from the training. The problems can be solved by giving multiple experts and representing their features in the latent space. the proposed method extends information maximizing generative adversarial imitation learning with adversarial inverse reinforcement learning to deal with such environment. Experiments show that the proposed method can not only imitate multiple experts, but also estimate ungiven polices.</p>

Journal

IEEJ Transactions on Electronics, Information and Systems

IEEJ Transactions on Electronics, Information and Systems 141 (12), 1405-1410, 2021-12-01

The Institute of Electrical Engineers of Japan

References(1)*help

Related Projects

Keywords

Details 詳細情報について

CRID

1390008764029176576
NII Article ID

130008123513
NII Book ID

AN10065950
DOI

10.1541/ieejeiss.141.1405
ISSN

13488155

03854221
NDL BIB ID

031857151
Web Site

http://id.ndl.go.jp/bib/031857151

https://ndlsearch.ndl.go.jp/books/R000000004-I031857151

https://www.jstage.jst.go.jp/article/ieejeiss/141/12/141_1405/_pdf
Text Lang

ja
Article Type

journal article
Data Source
- JaLC
- NDL Search
- Crossref
- CiNii Articles
- KAKEN
- OpenAIRE
Abstract License Flag
Disallowed

Adversarial Inverse Reinforcement Learning to Estimate Policies from Multiple Experts

Bibliographic Information

Search this article

Description

Journal

References(1)*help

Related Projects

Keywords

Details 詳細情報について

Export

Report a problem