Outdoor Acoustic Event Identification with DNN Using a Quadrotor-Embedded Microphone Array
-
- Sugiyama Osamu
- Preemptive Medicine & Lifestyle-Related Disease Research Center, Kyoto University Hospital
-
- Uemura Satoshi
- Graduate School of Information Science and Engineering, Tokyo Institute of Technology
-
- Nagamine Akihide
- Department of Electrical and Electronic Engineering, School of Engineering, Tokyo Institute of Technology
-
- Kojima Ryosuke
- Graduate School of Information Science and Engineering, Tokyo Institute of Technology
-
- Nakamura Keisuke
- Honda Research Institute Japan Co., Ltd.
-
- Nakadai Kazuhiro
- Graduate School of Information Science and Engineering, Tokyo Institute of Technology Honda Research Institute Japan Co., Ltd.
この論文をさがす
説明
<p>This paper addresses Acoustic Event Identification (AEI) of acoustic signals observed with a microphone array embedded in a quadrotor that is flying in a noisy outdoor environment. In such an environment, noise generated by rotors, wind, and other sound sources is a big problem. To solve this, we propose the use of a combination of two approaches that have recently been introduced: Sound Source Separation (SSS) and Sound Source Identification (SSI). SSS improves the Signal-to-Noise Ratio (SNR) of the input sound, and SSI is then performed on the SNR-improved sound. Two SSS methods are investigated. One is a single channel algorithm, Robust Principal Component Analysis (RPCA), and the other is Geometric High-order Decorrelation-based Source Separation (GHDSS-AS), known as a multichannel method. For SSI, we investigate two types of deep neural networks namely Stacked denoising Autoencoder (SdA) and Convolutional Neural Network (CNN), which have been extensively studied as highly-performant approaches in the fields of automatic speech recognition and visual object recognition. Preliminary experiments have showed the effectiveness of the proposed approaches, a combination of GHDSS-AS and CNN in particular. This combination correctly identified over 80% of sounds in an 8-class sound classification recorded by a hovering quadrotor. In addition, the CNN identifier that was implemented could be handled even with a low-end CPU by measuring the prediction time.</p>
収録刊行物
-
- Journal of Robotics and Mechatronics
-
Journal of Robotics and Mechatronics 29 (1), 188-197, 2017-02-20
富士技術出版株式会社
- Tweet
キーワード
詳細情報 詳細情報について
-
- CRID
- 1390282763070436864
-
- NII論文ID
- 130007519829
-
- NII書誌ID
- AA10809998
-
- ISSN
- 18838049
- 09153942
-
- NDL書誌ID
- 027998535
-
- 本文言語コード
- en
-
- 資料種別
- journal article
-
- データソース種別
-
- JaLC
- NDLサーチ
- Crossref
- CiNii Articles
- KAKEN
- OpenAIRE
-
- 抄録ライセンスフラグ
- 使用不可