状態非依存の方策を用いた新しい強化学習手法の提案

中野 太智, 石井 信, 前田 新一

doi:10.5687/iscie.27.327

書誌事項

タイトル別名

<b>Proposal of New Reinforcement Learning with a State-independent Policy</b>
ジョウタイヒイソンノホウサクオモチイタアタラシイキョウカガクシュウシュホウノテイアン
Proposal of New Reinforcement Learning with a State-independent Policy

公開日: 2014

資源種別: journal article

DOI

10.5687/iscie.27.327

公開者: 一般社団法人システム制御情報学会

この論文をさがす

説明

Usually, reinforcement learning (RL) algorithms have a difficulty to learn the optimal control policy as the dimensionality of the state (and action) becomes large, because of the explosive increase in the search space to optimize. To avoid such an unfavorable explosive increase, in this study, we propose BASLEM algorithm (Blind Action Sequence Learning with EM algorithm) which acquires a state-independent and time-dependent control policy starting from a certain fixed initial state. Numerical simulation to control a non-holonomic system shows that RL of state-independent and time-dependent policies attain great improvement in efficiency over the existing RL algorithm.

収録刊行物

システム制御情報学会論文誌

システム制御情報学会論文誌 27 (8), 327-332, 2014

一般社団法人システム制御情報学会

キーワード

詳細情報詳細情報について

CRID: 1390282680143427840

NII論文ID: 130004707732

NII書誌ID: AN1013280X

DOI: 10.5687/iscie.27.327

ISSN: 2185811X; 13425668

NDL書誌ID: 025637975

Web Site: http://id.ndl.go.jp/bib/025637975; https://ndlsearch.ndl.go.jp/books/R000000004-I025637975

本文言語コード: ja

資料種別: journal article

データソース種別

JaLC
NDLサーチ
Crossref
CiNii Articles
KAKEN

抄録ライセンスフラグ: 使用不可

書き出し

問題の指摘

状態非依存の方策を用いた新しい強化学習手法の提案

書誌事項

この論文をさがす

説明

収録刊行物

参考文献 (1)*注記

関連プロジェクト

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

状態非依存の方策を用いた新しい強化学習手法の提案

書誌事項

この論文をさがす

説明

収録刊行物

参考文献 (1)*注記

関連プロジェクト

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について