- 【Updated on May 12, 2025】 Integration of CiNii Dissertations and CiNii Books into CiNii Research
- Trial version of CiNii Research Knowledge Graph Search feature is available on CiNii Labs
- 【Updated on June 30, 2025】Suspension and deletion of data provided by Nikkei BP
- Regarding the recording of “Research Data” and “Evidence Data”
-
- Yashima Ryota
- Tohoku University
-
- Yamaguchi Akihiko
- Tohoku University
-
- Hashimoto Koichi
- Tohoku University
Bibliographic Information
- Other Title
-
- 複雑なダイナミクス構造におけるモデルベース型強化学習のデバッグ手法
Description
<p>In this study, we explore a systematic debugging method for model-based reinforcement learning where a library of skills is introduced. When the performance (learning speed, obtained quality of behavior) of model-based reinforcement learning is not sufficient, identifying the reason is difficult especially when the dynamics are complicated such as liquid pouring. In our previous work, we introduced a library of skills in reinforcement learning of such complicated tasks. We think that the use of a skill library is also beneficial to investigate the performance issues since we can test each subset of skills separately. Our goal is making a systematic debugging way of reinforcement learning based on this idea. This paper reports a preliminary development toward this goal where we repeatedly increase and decrease the complexity of a subtask to make debug easier like curriculum learning until we can obtain sufficient results with the original task. We conducted simulation experiments of liquid pouring to investigate this approach. The results show a performance improvement.</p>
Journal
-
- The Proceedings of JSME annual Conference on Robotics and Mechatronics (Robomec)
-
The Proceedings of JSME annual Conference on Robotics and Mechatronics (Robomec) 2021 (0), 1A1-F05-, 2021
The Japan Society of Mechanical Engineers
- Tweet
Details 詳細情報について
-
- CRID
- 1390290537432796032
-
- NII Article ID
- 130008134842
-
- ISSN
- 24243124
-
- Text Lang
- ja
-
- Data Source
-
- JaLC
- Crossref
- CiNii Articles
- OpenAIRE
-
- Abstract License Flag
- Disallowed