ＴＤ誤差に基づく強化学習のメタパラメータ学習法

溝上 裕之, 小林 邦和, 呉本 尭, 大林 正直

doi:10.1541/ieejeiss.129.1730

書誌事項

タイトル別名

A Meta-Parameter Learning Method in Reinforcement Learning Based on Temporal Difference Error
TD ゴサニモトズクキョウカガクシュウノメタパラメータガクシュウホウ

この論文をさがす

説明

In general, meta-parameters in a reinforcement learning system such as learning rate are empirically determined and fixed during the learning. Therefore, when an external environment has changed, the sytem cannot adjust to the change. Meanwhile, it is suggested that the biological brain could conduct reinforcement learning and adjust to the external environment by controlling neuromodulators corresponding to meta-parameters. In the present paper, based on the above suggestion, a method to adjust meta-parameters using the TD-error is proposed. Through computer simulations using maze problem and inverted pendulum control problem, it is verified that meta-parameters are appropriately adjusted according to the amplitude of the TD-error.

収録刊行物

電気学会論文誌Ｃ（電子・情報・システム部門誌）

電気学会論文誌Ｃ（電子・情報・システム部門誌） 129 (9), 1730-1736, 2009

一般社団法人電気学会

キーワード

詳細情報詳細情報について

CRID: 1390282679582797824

NII論文ID: 10025102012

NII書誌ID: AN10065950

DOI: 10.1541/ieejeiss.129.1730

ISSN: 13488155; 03854221

NDL書誌ID: 10421449

Web Site: http://id.ndl.go.jp/bib/10421449; https://ndlsearch.ndl.go.jp/books/R000000004-I10421449; http://www.jstage.jst.go.jp/article/ieejeiss/129/9/129_9_1730/_pdf

本文言語コード: ja

資料種別: journal article

データソース種別

JaLC
NDLサーチ
Crossref
CiNii Articles
KAKEN
OpenAIRE

抄録ライセンスフラグ: 使用不可

書き出し

問題の指摘

ＴＤ誤差に基づく強化学習のメタパラメータ学習法

書誌事項

この論文をさがす

説明

収録刊行物

参考文献 (18)*注記

関連プロジェクト

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

ＴＤ誤差に基づく強化学習のメタパラメータ学習法

書誌事項

この論文をさがす

説明

収録刊行物

参考文献 (18)*注記

関連プロジェクト

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について