An Extension of the Rational Policy Making algorithm to Continuous State Spaces
-
- Miyazaki Kazuteru
- Department of Assessment and Research for Degree Awarding, National Institution for Academic Degrees and University Evaluation
-
- Kimura Hajime
- Department of Marine Systems Engineering, Kyushu University
-
- Kobayashi Shigenobu
- Graduate School of Interdisciplinary Science and Engineering, Tokyo Institute of Technology
Bibliographic Information
- Other Title
-
- 合理的政策形成アルゴリズムの連続値入力への拡張
- ゴウリテキ セイサク ケイセイ アルゴリズム ノ レンゾクチ ニュウリョク エノ カクチョウ
Search this article
Abstract
Reinforcement Learning is a kind of machine learning. We know Profit Sharing, the Rational Policy Making algorithm (RPM), the Penalty Avoiding Rational Policy Making algorithm and PS-r* to guarantee the rationality in a typical class of the Partially Observable Markov Decision Processes. However they cannot treat continuous state spaces. In this paper, we present a solution to adapt them in continuous state spaces. We give RPM a mechanism to treat continuous state spaces in the environment that has the same type of a reward. We show the effectiveness of the proposed method in numerical examples.
Journal
-
- Transactions of the Japanese Society for Artificial Intelligence
-
Transactions of the Japanese Society for Artificial Intelligence 22 (3), 332-341, 2007
The Japanese Society for Artificial Intelligence
- Tweet
Details 詳細情報について
-
- CRID
- 1390282680085982208
-
- NII Article ID
- 10022007639
-
- NII Book ID
- AA11579226
-
- BIBCODE
- 2007TJSAI..22..332M
-
- ISSN
- 13468030
- 13460714
-
- NDL BIB ID
- 9603992
-
- Text Lang
- ja
-
- Data Source
-
- JaLC
- NDL
- Crossref
- CiNii Articles
- KAKEN
-
- Abstract License Flag
- Disallowed