振舞複製による敵対的模倣学習高速化に関する考察

佐々木 史紘, 山科 亮太

doi:10.1299/jsmermd.2020.2a1-l11

書誌事項

タイトル別名

A Study On Accelerating Adversarial Imitation Learning By Behavioral Cloning

説明

<p>Imitation learning is a popular method to obtain policies on autonomous robots given expert demonstrations. Recently, adversarial imitation learning methods, such as generative adversarial imitation learning (GAIL), have achieved great successes even on complex continuous control tasks. However, GAIL as well as its variants require a huge amount of environment interactions that often take impractically long time for training the robot. An intuitive way to reduce the number of interactions is initializing a policy by behavioral cloning (BC) before performing GAIL as pointed out in [1]. However, Sasaki et al reports that the BC initialization does not lead to reduce the number of interactions at all, rather significantly harms the imitation results. In this paper, we further analyze the BC initialization to figure out why the results are opposed to the intuition. Experimental results show that one of the cause of failure due to the BC initialization is that BC vanishes gradients of objective functions for the adversarial imitation learning algorithms, even though the objective differs from that of BC.</p>

収録刊行物

ロボティクス・メカトロニクス講演会講演概要集

ロボティクス・メカトロニクス講演会講演概要集 2020 (0), 2A1-L11-, 2020

一般社団法人日本機械学会

キーワード

詳細情報詳細情報について

CRID: 1391693801405363456

NII論文ID: 130007943992

DOI: 10.1299/jsmermd.2020.2a1-l11

ISSN: 24243124

Web Site: https://www.jstage.jst.go.jp/article/jsmermd/2020/0/2020_2A1-L11/_pdf

本文言語コード: ja

データソース種別

JaLC
Crossref
CiNii Articles

抄録ライセンスフラグ: 使用不可

書き出し

問題の指摘