Evaluation of Speech Interval Detection and Face Orientation Estimation based on Sound Directions by Multiple Microphone Arrays and Human Positions

Bibliographic Information

Other Title
  • 複数のマイクロホンアレイによる音源方向情報と人位置情報に基づく音声区間検出および顔の向きの推定の評価
  • フクスウ ノ マイクロホンアレイ ニ ヨル オト ゲン ホウコウ ジョウホウ ト ヒト イチ ジョウホウ ニ モトズク オンセイ クカン ケンシュツ オヨビ カオ ノ ムキ ノ スイテイ ノ ヒョウカ

Search this article

Abstract

We developed a system for detecting the speech intervals of multiple speakers and estimating the face orientation during the detected speech intervals by combining information of sound directions from multiple microphone arrays and human positions. The developed system was evaluated in three conditions: individual utterances in different positions and orientations, simultaneous dialogues by multiple speakers, and moving sources. Evaluation results revealed that the proposed system could detect speech intervals with more than 90% accuracy, and face orientations with mean absolute errors around 20 degrees, in situations excluding the cases where all arrays are in the opposite direction to the speaker's face orientation.

Journal

References(10)*help

See more

Details 詳細情報について

Report a problem

Back to top