不完全知覚問題に対するProfit Sharingと遺伝的アルゴリズムを用いたハイブリッド学習

鈴木 晃平, 加藤 昇平

doi:10.1541/ieejeiss.137.1591

不完全知覚問題に対するProfit Sharingと遺伝的アルゴリズムを用いたハイブリッド学習

DOI Web Site Web Site 参考文献4件

鈴木晃平

名古屋工業大学　大学院工学研究科　情報工学専攻
加藤昇平

名古屋工業大学　大学院工学研究科　情報工学専攻名古屋工業大学　情報科学フロンティア研究院

書誌事項

タイトル別名

Hybrid Learning Using Profit Sharing and Genetic Algorithm under the POMDPs
フカンゼンチカクモンダイニタイスル Profit Sharing トイデンテキアルゴリズムオモチイタハイブリッドガクシュウ

この論文をさがす

説明

<p>Reinforcement learning is generally performed in the Markov decision processes (MDP). However, there is a possibility that the agent can not correctly observe the environment due to the perception ability of the sensor. This is called partially observable Markov decision processes (POMDP). In a POMDP environment, an agent may observe the same information at more than one state. HQ-learning and Episode-based Profit Sharing (EPS) are well known methods for this problem. HQ-learning divides a POMDP environment into subtasks. EPS distributes same reward to state-action pairs in the episode when an agent achieves a goal. However, these methods have disadvantages in learning efficiency and localized solutions. In this paper, we propose a hybrid learning method combining PS and genetic algorithm. We also report the effectiveness of our method by some experiments with partially observable mazes.</p>

収録刊行物

電気学会論文誌Ｃ（電子・情報・システム部門誌）

電気学会論文誌Ｃ（電子・情報・システム部門誌） 137 (12), 1591-1599, 2017

一般社団法人電気学会

参考文献 (4)*注記

詳細情報詳細情報について

CRID

1390001204609709696
NII論文ID

130006235400
NII書誌ID

AN10065950
DOI

10.1541/ieejeiss.137.1591
ISSN

13488155

03854221
NDL書誌ID

028724878
Web Site

http://id.ndl.go.jp/bib/028724878

https://ndlsearch.ndl.go.jp/books/R000000004-I028724878

https://www.jstage.jst.go.jp/article/ieejeiss/137/12/137_1591/_pdf
本文言語コード

ja
データソース種別
- JaLC
- NDLサーチ
- Crossref
- CiNii Articles
- OpenAIRE
抄録ライセンスフラグ
使用不可

書き出し

問題の指摘

ページトップへ

不完全知覚問題に対するProfit Sharingと遺伝的アルゴリズムを用いたハイブリッド学習

書誌事項

この論文をさがす

説明

収録刊行物

参考文献 (4)*注記

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について