書誌事項
- タイトル別名
-
- Real-Time Human Tracking by Audio-Visual Integration for Humanoids-Integration of Active Audition and Face Recognition-
- ヒューマノイド オ タイショウ ニ シタ シチョウカク トウゴウ ニ ヨル ジツジカン ジンブツ ツイセキ アクティブオーディション ト カオ ニンシキ ノ トウゴウ
- —Integration of Active Audition and Face Recognition—
- ―アクティブオーディションと顔認識の統合―
この論文をさがす
説明
This paper describes a real-time human tracking system by audio-visual integrtation for the humanoid SIG. An essential idea for real-time and robust tracking is hierarchical integration of multi-modal information. The system creates three kinds of streams - auditory, visual and associated streams. An auditory stream with sound source direction is formed as temporal series of events from audition module which localizes multiple sound sources and cancels motor noise from a pair of microphones. A visual stream with a face ID and its 3D-position is formed as temporal series of events from vision module by combining face detection, face identification and face localization by stereo vision. Auditory and visual streams are associated into an associated stream, a higher level representation according to their proximity. Because the associated stream disambiguates parcially missing information in auditory or visual streams, “focus-of-attention” control of SIG works well enough to robust human tracking. These processes are executed in real-time with the delay of 200 msec using off-the-shelf PCs distributed via TCP/IP. As a result, robust human tracking is attained even when the person is visually occluded and simultaneous speeches occur.
収録刊行物
-
- 日本ロボット学会誌
-
日本ロボット学会誌 21 (5), 517-525, 2003
一般社団法人 日本ロボット学会
- Tweet
キーワード
詳細情報 詳細情報について
-
- CRID
- 1390001204725392384
-
- NII論文ID
- 10011243291
-
- NII書誌ID
- AN00141189
-
- ISSN
- 18847145
- 02891824
-
- NDL書誌ID
- 6646149
-
- 本文言語コード
- ja
-
- データソース種別
-
- JaLC
- NDL
- Crossref
- CiNii Articles
-
- 抄録ライセンスフラグ
- 使用不可