Optimizing Betting Fraction in Compound Reinforcement Learning

Matsui Tohgoroh, Goto Takashi, Izumi Kiyoshi, Chen Yu

doi:10.1527/tjsai.28.267

Bibliographic Information

Other Title

複利型強化学習における投資比率の最適化

Abstract

This paper describes optimization of the betting fraction parameter in compound reinforcement learning. Compound reinforcement learning maximizes the expected logarithm of compound returns in return-based MDPs. However, a new betting fraction parameter is introduced in order not to diverge values to negative infinity and it causes a problem of choosing the parameter. In this paper, we proposed a method to optimize the betting fraction with on-line gradient ascent in compound reinforcement learning.

Journal

Transactions of the Japanese Society for Artificial Intelligence

Transactions of the Japanese Society for Artificial Intelligence 28 (3), 267-272, 2013

The Japanese Society for Artificial Intelligence

Keywords

Details 詳細情報について

CRID: 1390282680084776576

NII Article ID: 130003362329

DOI: 10.1527/tjsai.28.267

BIBCODE: 2013TJSAI..28..267M

ISSN: 13468030; 13460714

Web Site: https://www.jstage.jst.go.jp/article/tjsai/28/3/28_267/_pdf

Text Lang: ja

Data Source

JaLC
Crossref
CiNii Articles
KAKEN

Abstract License Flag: Disallowed

Export