Swarm reinforcement learning methods for problems with continuous state-action space

Hitoshi Iima, Yasuaki Kuroe, Kazuo Emoto

doi:10.1109/icsmc.2011.6083999

Swarm reinforcement learning methods for problems with continuous state-action space

説明

We recently proposed swarm reinforcement learning methods in which multiple sets of an agent and an environment are prepared and the agents learn not only by individually performing a usual reinforcement learning method but also by exchanging information among them. Q-learning method has been used as the individual learning in the methods, and they have been applied to a problem with discrete state-action space. In the real world, however, there are many problems which are formulated as ones with continuous state-action space. This paper proposes swarm reinforcement learning methods based on an actor-critic method in order to acquire optimal policies rapidly for problems with continuous state-action space. The proposed methods are applied to a biped robot control problem, and their performance is examined through numerical experiments.

収録刊行物

2011 IEEE International Conference on Systems, Man, and Cybernetics

2011 IEEE International Conference on Systems, Man, and Cybernetics 2173-2180, 2011-10

IEEE

Swarm reinforcement learning methods for problems with continuous state-action space

説明

収録刊行物

被引用文献 (1)*注記

参考文献 (11)*注記

関連プロジェクト

詳細情報詳細情報について

書き出し

問題の指摘

Swarm reinforcement learning methods for problems with continuous state-action space

説明

収録刊行物

被引用文献 (1)*注記

参考文献 (11)*注記

関連プロジェクト

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について