Particle Filter Design Based on Reinforcement Learning and Its Application to Mobile Robot Localization

DOI Web Site 参考文献20件 オープンアクセス
  • YOSHIMURA Ryota
    Department of Aeronautics and Astronautics, Kyoto University Regional Technology Support Division, Tokyo Metropolitan Industrial Technology Research Institute
  • MARUTA Ichiro
    Department of Aeronautics and Astronautics, Kyoto University
  • FUJIMOTO Kenji
    Department of Aeronautics and Astronautics, Kyoto University
  • SATO Ken
    Digitalization Promotion Section, Tokyo Metropolitan Industrial Technology Research Institute
  • KOBAYASHI Yusuke
    Digitalization Promotion Section, Tokyo Metropolitan Industrial Technology Research Institute

説明

<p>Particle filters have been widely used for state estimation problems in nonlinear and non-Gaussian systems. Their performance depends on the given system and measurement models, which need to be designed by the user for each target system. This paper proposes a novel method to design these models for a particle filter. This is a numerical optimization method, where the particle filter design process is interpreted into the framework of reinforcement learning by assigning the randomnesses included in both models of the particle filter to the policy of reinforcement learning. In this method, estimation by the particle filter is repeatedly performed and the parameters that determine both models are gradually updated according to the estimation results. The advantage is that it can optimize various objective functions, such as the estimation accuracy of the particle filter, the variance of the particles, the likelihood of the parameters, and the regularization term of the parameters. We derive the conditions to guarantee that the optimization calculation converges with probability 1. Furthermore, in order to show that the proposed method can be applied to practical-scale problems, we design the particle filter for mobile robot localization, which is an essential technology for autonomous navigation. By numerical simulations, it is demonstrated that the proposed method further improves the localization accuracy compared to the conventional method.</p>

収録刊行物

参考文献 (20)*注記

もっと見る

詳細情報 詳細情報について

問題の指摘

ページトップへ