複利型強化学習における投資比率の最適化

松井 藤五郎, 後藤 卓, 和泉 潔, 陳 ユ

doi:10.1527/tjsai.28.267

書誌事項

タイトル別名

Optimizing Betting Fraction in Compound Reinforcement Learning

抄録

This paper describes optimization of the betting fraction parameter in compound reinforcement learning. Compound reinforcement learning maximizes the expected logarithm of compound returns in return-based MDPs. However, a new betting fraction parameter is introduced in order not to diverge values to negative infinity and it causes a problem of choosing the parameter. In this paper, we proposed a method to optimize the betting fraction with on-line gradient ascent in compound reinforcement learning.

収録刊行物

人工知能学会論文誌

人工知能学会論文誌 28 (3), 267-272, 2013

一般社団法人人工知能学会

キーワード

詳細情報詳細情報について

CRID: 1390282680084776576

NII論文ID: 130003362329

DOI: 10.1527/tjsai.28.267

BIBCODE: 2013TJSAI..28..267M

ISSN: 13468030; 13460714

Web Site: https://www.jstage.jst.go.jp/article/tjsai/28/3/28_267/_pdf

本文言語コード: ja

データソース種別

JaLC
Crossref
CiNii Articles
KAKEN

抄録ライセンスフラグ: 使用不可

複利型強化学習における投資比率の最適化

書誌事項

抄録

収録刊行物

参考文献 (3)*注記

関連プロジェクト

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

複利型強化学習における投資比率の最適化

書誌事項

抄録

収録刊行物

参考文献 (3)*注記

関連プロジェクト

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について