A Case-based Reward Function Design for Reinforcement Learning-based Pure Pursuit Hybrid Controller
説明
This paper presents an innovative approach to enhancing the Pure Pursuit algorithm for path tracking in autonomous vehicles by integrating Reinforcement Learning and curvature information. Traditional Pure Pursuit algorithms, while effective in low-speed scenarios, often require extensive manual tuning of the look-ahead distance to maintain tracking accuracy at varying speeds and complex paths. To address these limitations, we designed an RL-based pure pursuit controller incorporating future curvature into the state space and reward function to enhance learning a proper tracking policy at higher speeds. The controller is trained and evaluated in the CARLA simulator, demonstrating improved performance in terms of path-tracking accuracy and stability across different speeds and path complexities. By comparing the controller which considered curvature improvement with the original one, our results show that the improved method can achieve lower lateral deviation and lateral acceleration while maintaining almost the same average speed.
This paper presents an innovative approach to enhancing the Pure Pursuit algorithm for path tracking in autonomous vehicles by integrating Reinforcement Learning and curvature information. Traditional Pure Pursuit algorithms, while effective in low-speed scenarios, often require extensive manual tuning of the look-ahead distance to maintain tracking accuracy at varying speeds and complex paths. To address these limitations, we designed an RL-based pure pursuit controller incorporating future curvature into the state space and reward function to enhance learning a proper tracking policy at higher speeds. The controller is trained and evaluated in the CARLA simulator, demonstrating improved performance in terms of path-tracking accuracy and stability across different speeds and path complexities. By comparing the controller which considered curvature improvement with the original one, our results show that the improved method can achieve lower lateral deviation and lateral acceleration while maintaining almost the same average speed.
収録刊行物
-
- 第32回マルチメディア通信と分散処理ワークショップ論文集
-
第32回マルチメディア通信と分散処理ワークショップ論文集 71-77, 2024-10-23
情報処理学会
- Tweet
キーワード
詳細情報 詳細情報について
-
- CRID
- 1050020519548271616
-
- 本文言語コード
- en
-
- 資料種別
- conference paper
-
- データソース種別
-
- IRDB