Visual Recognition of Spoken Words Using Optical Flow

Bibliographic Information

Other Title
  • 動画像のOptical Flowを用いた発声単語認識システム
  • ドウガゾウ ノ Optical Flow オ モチイタ ハッセイ タンゴ ニンシキ システム

Search this article

Abstract

This paper describes an automatic vision-based spoken word recognition system that utilizes, instead of audio signal, visual motion signal which is obtained from motion pictures taken of a region around the mouth during speech. Motion information on each pixel in the input time-series imagery was obtained by computation of optical flow, and feature values representing a spatial configuration of pixel-wise velocities were extracted for each frame image. Both starting and ending points of time for each spoken word were defined using the velocity feature values, and a high dimensional feature vector was obtained to indicate time variation of the velocity distribution within the period of utterance. As a preliminary performance evaluation of the proposed feature in spoken word recognition, discrimination test of five spoken words including A-RI-GA-TO-U and KO-N-NI-CHI-WA was conducted, and fairly promising results were achieved.

Journal

Related Projects

See more

Details 詳細情報について

Report a problem

Back to top