Evaluation of Speech Interval Detection and Face Orientation Estimation based on Sound Directions by Multiple Microphone Arrays and Human Positions
-
- Ishi Carlos T.
- ATR Intelligent Robotics and Communication Labs.
-
- Even Jani
- ATR Intelligent Robotics and Communication Labs.
-
- Hagita Norihiro
- ATR Intelligent Robotics and Communication Labs.
Bibliographic Information
- Other Title
-
- 複数のマイクロホンアレイによる音源方向情報と人位置情報に基づく音声区間検出および顔の向きの推定の評価
- フクスウ ノ マイクロホンアレイ ニ ヨル オト ゲン ホウコウ ジョウホウ ト ヒト イチ ジョウホウ ニ モトズク オンセイ クカン ケンシュツ オヨビ カオ ノ ムキ ノ スイテイ ノ ヒョウカ
Search this article
Abstract
We developed a system for detecting the speech intervals of multiple speakers and estimating the face orientation during the detected speech intervals by combining information of sound directions from multiple microphone arrays and human positions. The developed system was evaluated in three conditions: individual utterances in different positions and orientations, simultaneous dialogues by multiple speakers, and moving sources. Evaluation results revealed that the proposed system could detect speech intervals with more than 90% accuracy, and face orientations with mean absolute errors around 20 degrees, in situations excluding the cases where all arrays are in the opposite direction to the speaker's face orientation.
Journal
-
- Journal of the Robotics Society of Japan
-
Journal of the Robotics Society of Japan 34 (3), 199-204, 2016
The Robotics Society of Japan
- Tweet
Keywords
Details 詳細情報について
-
- CRID
- 1390282679706292992
-
- NII Article ID
- 130005157569
-
- NII Book ID
- AN00141189
-
- ISSN
- 18847145
- 02891824
-
- NDL BIB ID
- 027303828
-
- Text Lang
- ja
-
- Data Source
-
- JaLC
- NDL
- Crossref
- CiNii Articles
-
- Abstract License Flag
- Disallowed