Reinforcement learning using on-line EM algorithm

ISHII Shin, SATO Masa-aki

Bibliographic Information

Other Title

オンラインEMアルゴリズムを用いた強化学習法

Description

In this research report, we propose a new reinforcement learning (RL) method based on an actor-critic architecture. The actor and the critic are approximated by normalized Gausssian networks, which are trained by the on-line EM algorithm proposed in our previous paper. We apply our RL method to the task of swing-up and stabilizing a single pendulum and the task of balacing a double pendulum near the upright position. The experimental results show that our RL method can be applied to optimal control problems having continuous state/action spaces.

Journal

IEICE technical report. Neurocomputing

IEICE technical report. Neurocomputing 98 (577), 41-48, 1999-02-05

The Institute of Electronics, Information and Communication Engineers

Keywords

Details 詳細情報について

CRID: 1570291227540510080

NII Article ID: 110003233550

NII Book ID: AN10091178

Text Lang: ja

Data Source

CiNii Articles

Export

Report a problem