Fast reinforcement learning using asymmetric probability density function

Masanao Obayashi, Kunikazu Kobayashi, Kousuke Umesako

doi:10.1109/sice.2002.1195260

説明

We propose an asymmetric probability density function (PDF) to select an effective action on reinforcement learning (RL). The proposed method utilizing the information of search direction enables RL to reduce the number of trials. Furthermore, the proposed method can be applied easily to various methods of RL, for example, actor-critic, stochastic gradient ascent method. The performance of our proposed method is demonstrated by computer simulations.

収録刊行物

Proceedings of the 41st SICE Annual Conference. SICE 2002.

Proceedings of the 41st SICE Annual Conference. SICE 2002. 2 804-809, 2003-06-26

Soc. Instrument & Control Eng. (SICE)

詳細情報詳細情報について

CRID: 1870302167923729664

DOI: 10.1109/sice.2002.1195260

データソース種別

OpenAIRE

書き出し

問題の指摘

Fast reinforcement learning using asymmetric probability density function

説明

収録刊行物

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について