Voice-activated word processor with automatic learning for dynamic optimization of syllable-templates.

Togawa Furnio, Hakaridani Mitsuhiro, Iwahashi Hiroyuki, Ueda Torn

doi:10.1250/ast.10.133

A voice-activated word processor has been realized which provides text input by articulated, phrase-by-phrase Japanese speech, using an automatic learning algorithm to improve syllable recognition accuracy. A speaker-dependent, continuous-phrase speech recognizer incorporated into the word processor is capable of real-time recognition using 111 monosyllables as the basic recognition units. The recognizer is trained by new user to construct 590 reference syllable templates based on uttering a set of words, necessary for syllable template-matching process. Syllable templates are continually updated during use of the word processor through automatic learning. The automatic learning algorithm replaces low-accuracy syllable templates with new patterns extracted from the input speech. The replacement is sensitive to syllable context and is carried out based on recent and longer term history of the recognition accuracy of each syllable. Such learning information is derived from comparison between syllable recognition results and a user-confirmed character string of input phrase as correct, and is accumulated on a phrase-by-phrase basis. The automatic learning algorithm was tested in experiments using Japanese sentences read with pauses between phrases at approximately 4 to 5 syllables per second. The results, using eight speakers, show average syllable recognition accuracy of 82.5% with automatic learning, compared to 71.0% achieved without the learning. Further, the recognition accuracy is increased to 86.5% when the maximum number of syllable templates is increased to 2, 048; both template replacement and template addition are carried out until the maximum number of templates is reached.

Voice-activated word processor with automatic learning for dynamic optimization of syllable-templates.

Search this article

Description

Journal

Keywords

Details 詳細情報について

Export

Report a problem