Particle Filter Design Based on Reinforcement Learning and Its Application to Mobile Robot Localization
-
- YOSHIMURA Ryota
- Department of Aeronautics and Astronautics, Kyoto University Regional Technology Support Division, Tokyo Metropolitan Industrial Technology Research Institute
-
- MARUTA Ichiro
- Department of Aeronautics and Astronautics, Kyoto University
-
- FUJIMOTO Kenji
- Department of Aeronautics and Astronautics, Kyoto University
-
- SATO Ken
- Digitalization Promotion Section, Tokyo Metropolitan Industrial Technology Research Institute
-
- KOBAYASHI Yusuke
- Digitalization Promotion Section, Tokyo Metropolitan Industrial Technology Research Institute
説明
<p>Particle filters have been widely used for state estimation problems in nonlinear and non-Gaussian systems. Their performance depends on the given system and measurement models, which need to be designed by the user for each target system. This paper proposes a novel method to design these models for a particle filter. This is a numerical optimization method, where the particle filter design process is interpreted into the framework of reinforcement learning by assigning the randomnesses included in both models of the particle filter to the policy of reinforcement learning. In this method, estimation by the particle filter is repeatedly performed and the parameters that determine both models are gradually updated according to the estimation results. The advantage is that it can optimize various objective functions, such as the estimation accuracy of the particle filter, the variance of the particles, the likelihood of the parameters, and the regularization term of the parameters. We derive the conditions to guarantee that the optimization calculation converges with probability 1. Furthermore, in order to show that the proposed method can be applied to practical-scale problems, we design the particle filter for mobile robot localization, which is an essential technology for autonomous navigation. By numerical simulations, it is demonstrated that the proposed method further improves the localization accuracy compared to the conventional method.</p>
収録刊行物
-
- IEICE Transactions on Information and Systems
-
IEICE Transactions on Information and Systems E105.D (5), 1010-1023, 2022-05-01
一般社団法人 電子情報通信学会
- Tweet
詳細情報 詳細情報について
-
- CRID
- 1390291932623101184
-
- ISSN
- 17451361
- 09168532
-
- 本文言語コード
- en
-
- データソース種別
-
- JaLC
- Crossref
- OpenAIRE
-
- 抄録ライセンスフラグ
- 使用不可