Swarm reinforcement learning methods for problems with continuous state-action space

Hitoshi Iima, Yasuaki Kuroe, Kazuo Emoto

doi:10.1109/icsmc.2011.6083999

Swarm reinforcement learning methods for problems with continuous state-action space

Description

We recently proposed swarm reinforcement learning methods in which multiple sets of an agent and an environment are prepared and the agents learn not only by individually performing a usual reinforcement learning method but also by exchanging information among them. Q-learning method has been used as the individual learning in the methods, and they have been applied to a problem with discrete state-action space. In the real world, however, there are many problems which are formulated as ones with continuous state-action space. This paper proposes swarm reinforcement learning methods based on an actor-critic method in order to acquire optimal policies rapidly for problems with continuous state-action space. The proposed methods are applied to a biped robot control problem, and their performance is examined through numerical experiments.

Journal

2011 IEEE International Conference on Systems, Man, and Cybernetics

2011 IEEE International Conference on Systems, Man, and Cybernetics 2173-2180, 2011-10

IEEE

Citations (1)*help

References(11)*help

Related Projects

Details 詳細情報について

CRID

1360285710242980224
DOI

10.1109/icsmc.2011.6083999
Web Site

http://xplorestaging.ieee.org/ielx5/6070513/6083622/06083999.pdf?arnumber=6083999
Article Type

journal article
Data Source
- Crossref
- KAKEN
- OpenAIRE

Swarm reinforcement learning methods for problems with continuous state-action space

Description

Journal

Citations (1)*help

References(11)*help

Related Projects

Details 詳細情報について

Export

Report a problem