- 【Updated on May 12, 2025】 Integration of CiNii Dissertations and CiNii Books into CiNii Research
- Trial version of CiNii Research Knowledge Graph Search feature is available on CiNii Labs
- Suspension and deletion of data provided by Nikkei BP
- Regarding the recording of “Research Data” and “Evidence Data”
Simultaneous learning of situation classification based on rewards and behavior selection based on the situation
Description
This paper describes a system with which a cognitive agent learns the way of abstraction and the policy of behavior selection simultaneously. We call the system situation transition network system (STNS). The system extracts situations and maintains them dynamically in the continuous state space on the basis of rewards from the environment. In this way, the system learns the way of abstraction in a dynamic environment. At the same time, the system stores results of transitions between situations and constructs a network of situations. This network is used for partial planning. At a point of time in the learning process, the system selects a behavior according to the partial plan. Because the planning is performed on a network of the abstracted situations, the agent with STNS does not have to deliberate details in planning. Furthermore, the agent can make a plan even on the early stage of learning because the planning is partial. Owing to the simultaneous learning with task executions the agent can adapt to the current task. The results of computer simulations are given.
Journal
-
- Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96
-
Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96 3 1510-1517, 2002-12-24
IEEE