Policy Gradient Reinforcement Learning for Membership Functions in Policy Represented by Fuzzy Rules: Application to Simulations on Speed Control of an Automobile

DOI

Bibliographic Information

Other Title
  • ファジィ制御ルールにより表現された方策を持つ方策勾配法: 自動車の速度制御問題におけるメンバシップ関数の学習

Abstract

<p>A method of a fusion of fuzzy inference and policy gradient reinforcement learning has been proposed that directly learns, as maximizes the expected value of the reward per episode, parameters in a policy function represented by fuzzy rules with weights and membership functions. A study has applied this method to a task of speed control of an automobile and has obtained correct policies with learned weights of rules, some of which control speed of the automobile appropriately. However, membership functions that quantify fuzzy concepts were designed based on human knowledge. Therefore, in this research, we show the result of experiments that the fusion method can learn the membership functions represented by a layered neural network.</p>

Journal

Details 詳細情報について

  • CRID
    1390299086443515264
  • DOI
    10.14864/fss.39.0_422
  • Text Lang
    ja
  • Data Source
    • JaLC
  • Abstract License Flag
    Disallowed

Report a problem

Back to top