Policy Gradient Reinforcement Learning for Membership Functions in Policy Represented by Fuzzy Rules: Application to Simulations on Speed Control of an Automobile
-
- Kojima Reo
- Graduate School of Tokyo Denki University
-
- Ishihara Seiji
- Tokyo Denki University
-
- Ichige Shun
- University of Tsukuba
-
- Igarashi Harukazu
- Shibaura Institute of Technology
Bibliographic Information
- Other Title
-
- ファジィ制御ルールにより表現された方策を持つ方策勾配法: 自動車の速度制御問題におけるメンバシップ関数の学習
Abstract
<p>A method of a fusion of fuzzy inference and policy gradient reinforcement learning has been proposed that directly learns, as maximizes the expected value of the reward per episode, parameters in a policy function represented by fuzzy rules with weights and membership functions. A study has applied this method to a task of speed control of an automobile and has obtained correct policies with learned weights of rules, some of which control speed of the automobile appropriately. However, membership functions that quantify fuzzy concepts were designed based on human knowledge. Therefore, in this research, we show the result of experiments that the fusion method can learn the membership functions represented by a layered neural network.</p>
Journal
-
- Proceedings of the Fuzzy System Symposium
-
Proceedings of the Fuzzy System Symposium 39 (0), 422-427, 2023
Japan Society for Fuzzy Theory and Intelligent Informatics
- Tweet
Details 詳細情報について
-
- CRID
- 1390299086443515264
-
- Text Lang
- ja
-
- Data Source
-
- JaLC
-
- Abstract License Flag
- Disallowed