Challenges for the policy representation when applying reinforcement learning in robotics

Kormushev, P, Calinon, S, Ugurlu, B, Caldwell, DG

doi:10.1109/ijcnn.2012.6252758

A summary of the state-of-the-art reinforcement learning in robotics is given, in terms of both algorithms and policy representations. Numerous challenges faced by the policy representation in robotics are identified. Two recent examples for application of reinforcement learning to robots are described: pancake flipping task and bipedal walking energy minimization task. In both examples, a state-of-the-art Expectation-Maximization-based reinforcement learning algorithm is used, but different policy representations are proposed and evaluated for each task. The two proposed policy representations offer viable solutions to four rarely-addressed challenges in policy representations: correlations, adaptability, multi-resolution, and globality. Both the successes and the practical difficulties encountered in these examples are discussed.

Challenges for the policy representation when applying reinforcement learning in robotics

説明

収録刊行物

詳細情報詳細情報について

書き出し

問題の指摘

Challenges for the policy representation when applying reinforcement learning in robotics

説明

収録刊行物

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について