Fast reinforcement learning using asymmetric probability density function

Masanao Obayashi, Kunikazu Kobayashi, Kousuke Umesako

doi:10.1109/sice.2002.1195260

【Updated on May 12, 2025】 Integration of CiNii Dissertations and CiNii Books into CiNii Research
Trial version of CiNii Research Knowledge Graph Search feature is available on CiNii Labs
【Updated on June 30, 2025】Suspension and deletion of data provided by Nikkei BP
Regarding the recording of “Research Data” and “Evidence Data”

Fast reinforcement learning using asymmetric probability density function

Description

We propose an asymmetric probability density function (PDF) to select an effective action on reinforcement learning (RL). The proposed method utilizing the information of search direction enables RL to reduce the number of trials. Furthermore, the proposed method can be applied easily to various methods of RL, for example, actor-critic, stochastic gradient ascent method. The performance of our proposed method is demonstrated by computer simulations.

Journal

Proceedings of the 41st SICE Annual Conference. SICE 2002.

Proceedings of the 41st SICE Annual Conference. SICE 2002. 2 804-809, 2003-06-26

Soc. Instrument & Control Eng. (SICE)

Details 詳細情報について

CRID

1870302167923729664
DOI

10.1109/sice.2002.1195260
Data Source
- OpenAIRE

Fast reinforcement learning using asymmetric probability density function

Description

Journal

Details 詳細情報について

Export

Report a problem