複数のマイクロホンアレイによる音源方向情報と人位置情報に基づく音声区間検出および顔の向きの推定の評価

書誌事項

タイトル別名
  • Evaluation of Speech Interval Detection and Face Orientation Estimation based on Sound Directions by Multiple Microphone Arrays and Human Positions
  • フクスウ ノ マイクロホンアレイ ニ ヨル オト ゲン ホウコウ ジョウホウ ト ヒト イチ ジョウホウ ニ モトズク オンセイ クカン ケンシュツ オヨビ カオ ノ ムキ ノ スイテイ ノ ヒョウカ

この論文をさがす

抄録

We developed a system for detecting the speech intervals of multiple speakers and estimating the face orientation during the detected speech intervals by combining information of sound directions from multiple microphone arrays and human positions. The developed system was evaluated in three conditions: individual utterances in different positions and orientations, simultaneous dialogues by multiple speakers, and moving sources. Evaluation results revealed that the proposed system could detect speech intervals with more than 90% accuracy, and face orientations with mean absolute errors around 20 degrees, in situations excluding the cases where all arrays are in the opposite direction to the speaker's face orientation.

収録刊行物

参考文献 (10)*注記

もっと見る

詳細情報 詳細情報について

問題の指摘

ページトップへ