Optimizing Betting Fraction in Compound Reinforcement Learning
-
- Matsui Tohgoroh
- Chubu University
-
- Goto Takashi
- Bank of Tokyo-Mitsubishi UFJ, Ltd.
-
- Izumi Kiyoshi
- JST PRESTO The University of Tokyo
-
- Chen Yu
- The University of Tokyo
Bibliographic Information
- Other Title
-
- 複利型強化学習における投資比率の最適化
Abstract
This paper describes optimization of the betting fraction parameter in compound reinforcement learning. Compound reinforcement learning maximizes the expected logarithm of compound returns in return-based MDPs. However, a new betting fraction parameter is introduced in order not to diverge values to negative infinity and it causes a problem of choosing the parameter. In this paper, we proposed a method to optimize the betting fraction with on-line gradient ascent in compound reinforcement learning.
Journal
-
- Transactions of the Japanese Society for Artificial Intelligence
-
Transactions of the Japanese Society for Artificial Intelligence 28 (3), 267-272, 2013
The Japanese Society for Artificial Intelligence
- Tweet
Details 詳細情報について
-
- CRID
- 1390282680084776576
-
- NII Article ID
- 130003362329
-
- BIBCODE
- 2013TJSAI..28..267M
-
- ISSN
- 13468030
- 13460714
-
- Text Lang
- ja
-
- Data Source
-
- JaLC
- Crossref
- CiNii Articles
- KAKEN
-
- Abstract License Flag
- Disallowed