- 【Updated on May 12, 2025】 Integration of CiNii Dissertations and CiNii Books into CiNii Research
- Trial version of CiNii Research Knowledge Graph Search feature is available on CiNii Labs
- Suspension and deletion of data provided by Nikkei BP
- Regarding the recording of “Research Data” and “Evidence Data”
Performance of LQ-learning in POMDP Environments
-
- Lee Haeyeon
- Dept. of Elec. & Comm. Eng., School of Eng., Tohoku Univ.
-
- Kamaya Hiroyuki
- Dept. of Elec. Eng., Hachinohe National College of Technology
-
- Abe Kenich
- Dept. of Elec. & Comm. Eng., School of Eng., Tohoku Univ.
Description
In this paper, we propose a new type of LQ-learning to solve POMDP. In the POMDP environment, the agent cannot observe the environment directly. In the LQ-learning, in order to dicriminate partially observed states, the agent attaches label to each observation which perceived as the same ones. Unlike our previous LQ-learning, we make preparations of knowledge about the environment in advance. The knowledge is automatically acquired by Kohenen’s Self-Organizing Map (SOM), which provides the knowledge about state transitions to the agent. Then, LQ-learning agent attaches labels to observations with reference to a map obtained by SOM.
Journal
-
- SICE Annual Conference Program and Abstracts
-
SICE Annual Conference Program and Abstracts 2002 (0), 174-174, 2002
The Society of Instrument and Control Engineers
- Tweet
Keywords
Details 詳細情報について
-
- CRID
- 1390282680561053568
-
- NII Article ID
- 130006960136
-
- Text Lang
- en
-
- Data Source
-
- JaLC
- CiNii Articles
-
- Abstract License Flag
- Disallowed