Learning Quadcopter Maneuvers with Concurrent Methods of Policy Optimization

Huang Pei-Hua, Hasegawa Osamu

doi:10.20965/jaciii.2017.p0639

Learning Quadcopter Maneuvers with Concurrent Methods of Policy Optimization

DOI Web Site Web Site 参考文献18件

Huang Pei-Hua

Tokyo Institute of Technology
Hasegawa Osamu

Tokyo Institute of Technology

この論文をさがす

抄録

<p>This study presents an aerial robotic application of deep reinforcement learning that imparts an asynchronous learning framework and trust region policy optimization to a simulated quad-rotor helicopter (quadcopter) environment. In particular, we optimized a control policy asynchronously through interaction with concurrent instances of the environment. The control system was benchmarked and extended with examples to tackle continuous state-action tasks for the quadcoptor: hovering control and balancing an inverted pole. Performing these maneuvers required continuous actions for sensitive control of small acceleration changes of the quadcoptor, thereby maximizing the scalar reward of the defined tasks. The simulation results demonstrated an enhancement of the learning speed and reliability for the tasks.</p>

収録刊行物

Journal of Advanced Computational Intelligence and Intelligent Informatics

Journal of Advanced Computational Intelligence and Intelligent Informatics 21 (4), 639-649, 2017-07-20

富士技術出版株式会社

参考文献 (18)*注記

詳細情報詳細情報について

CRID

1390001288091294464
NII論文ID

130007520151
NII書誌ID

AA12042502
DOI

10.20965/jaciii.2017.p0639
ISSN

18838014

13430130
NDL書誌ID

028357027
Web Site

http://id.ndl.go.jp/bib/028357027

https://ndlsearch.ndl.go.jp/books/R000000004-I028357027

https://www.fujipress.jp/main/wp-content/themes/Fujipress/phyosetsu.php?ppno=JACII002100040005
本文言語コード

en
データソース種別
- JaLC
- NDL
- Crossref
- CiNii Articles
抄録ライセンスフラグ
使用不可

書き出し

問題の指摘

ページトップへ

Learning Quadcopter Maneuvers with Concurrent Methods of Policy Optimization

この論文をさがす

抄録

収録刊行物

参考文献 (18)*注記

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について