Evolution of Learning Parameters in a Team of Mobile Agents

説明

Most of everyday life environments are unknown and dynamic. Therefore, the artificial agents living in such environments must adapt their policy based on the environment conditions. In this paper, we consider a team of mobile agents that learns to survive by capturing the active battery packs. In our method, evolution considered metaparameters of an actor-critic reinforcement learning algorithm. Results show that after some generations the agents were able to survive and increase the energy level. In addition, the evolved metaparameters helped the agent to adapt much faster during the first stage of life and find an important relation between exploration-exploitation and energy level

収録刊行物

詳細情報 詳細情報について

問題の指摘

ページトップへ