Weighted Average Composition of Deep Reinforcement Learning Agents in Discrete Action Problems

SATO Kenichiro, KOHJIMA Masahiro, MATSUBAYASHI Tatsushi, TODA Hiroyuki

doi:10.14923/transinfj.2019det0002

Bibliographic Information

Other Title

深層強化学習Agentの離散行動空間タスクにおける重み付き結合

Description

Composition of pre-trained agents is gathering attention in the field of reinforcement learning since this approach allows us to construct an agent that solves a new task by combining multiple pre-trained agents that solve different tasks. In this study, we extend an existing method that composes pre-trained agents with simple average and propose a new method that composes pre-trained agents with a weighted average. The proposed method enables us to solve a new task whose reward function is expressed as the linear combination of base tasks. We verify the effectiveness of the proposed method by CartPole control and traffic signal control problems.

Journal

電子情報通信学会論文誌D 情報・システム

電子情報通信学会論文誌D 情報・システム J103-D (5), 403-414, 2020-05-01

The Institute of Electronics, Information and Communication Engineers

Keywords

Details 詳細情報について

CRID: 1390285300154149632

DOI: 10.14923/transinfj.2019det0002

ISSN: 18810225; 18804535

Text Lang: ja

Data Source

JaLC

Abstract License Flag: Disallowed

Export

Report a problem