Multiple index combination for Japanese spoken term detection with optimum index selection based on OOV-region classifier

Naoyuki Kanda, Katsutoshi Itoyama, Hiroshi G. Okuno

doi:10.1109/icassp.2013.6639332

Multiple index combination for Japanese spoken term detection with optimum index selection based on OOV-region classifier

DOI PDF 参考文献18件オープンアクセス

説明

In this paper, a novel index combination method for spoken term detection is proposed. In our method, outputs from four different recognizers (word, syllable, word-syllable, and fragment recognizer) are combined into one confusion network. A novel index-selection method for the multiple index-combination method is then used to suppress the increase of the index size. Two methods are proposed to reduce index size: (1) arc selection and (2) unit selection, both of which are based on an OOV-region classifier score. Experimental results with 39 hours of Japanese lecture recordings showed that the index-selection method achieved a 22% reduction of index size of the best confusion network while maintaining its high accuracy. Compared with the best phoneme-based index from a single recognizer, the proposed method achieved a 25.0% and 14.8% relative error reduction for IV and OOV queries without increasing the index size.

収録刊行物

2013 IEEE International Conference on Acoustics, Speech and Signal Processing

2013 IEEE International Conference on Acoustics, Speech and Signal Processing 8540-8544, 2013-05

IEEE

参考文献 (18)*注記

書き出し

問題の指摘

ページトップへ

Multiple index combination for Japanese spoken term detection with optimum index selection based on OOV-region classifier

説明

収録刊行物

参考文献 (18)*注記

関連プロジェクト

詳細情報詳細情報について

書き出し

問題の指摘

Multiple index combination for Japanese spoken term detection with optimum index selection based on OOV-region classifier

説明

収録刊行物

参考文献 (18)*注記

関連プロジェクト

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について