Confidence of agreement among multiple LVCSR models and model combination by SVM

Seiichi Nakagawa, Yasuhiro Kodama, Takehito Utsuro, Hiromitsu Nishizakil, Tomohiro Watanabel

doi:10.1109/icassp.2003.1198705

Confidence of agreement among multiple LVCSR models and model combination by SVM

Description

For many practical applications of speech recognition systems, it is quite desirable to have an estimate of confidence for each hypothesized word. Unlike previous works on confidence measures, we have proposed features for confidence measures that are extracted from outputs of more than one LVCSR models. For further analysis of the proposed confidence measure, this paper examines the correlation between each word's confidence and the word's features such as its part-of-speech and syllable length. We then apply SVM learning technique to the task of combining outputs of multiple LVCSR models, where, as features of SVM learning, information such as the pairs of the models which output the hypothesized word are useful for improving the word recognition rate. Experimental results show that the combination results achieve a relative word error reduction of up to 72 % against the best performing single model and that of up to 36 % against ROVER.

Journal

2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).

2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 1 I-16, 2003-11-21

IEEE

Details 詳細情報について

CRID

1872835442648691840
DOI

10.1109/icassp.2003.1198705
Data Source
- OpenAIRE

Confidence of agreement among multiple LVCSR models and model combination by SVM

Description

Journal

Details 詳細情報について

Export

Report a problem