- 【Updated on May 12, 2025】 Integration of CiNii Dissertations and CiNii Books into CiNii Research
- Trial version of CiNii Research Knowledge Graph Search feature is available on CiNii Labs
- Suspension and deletion of data provided by Nikkei BP
- Regarding the recording of “Research Data” and “Evidence Data”
Voice-activated word processor with automatic learning for dynamic optimization of syllable-templates.
-
- Togawa Furnio
- Information Systems Laboratories, Information Systems Group, Sharp Corporation
-
- Hakaridani Mitsuhiro
- Information Systems Laboratories, Information Systems Group, Sharp Corporation
-
- Iwahashi Hiroyuki
- Information Systems Laboratories, Information Systems Group, Sharp Corporation
-
- Ueda Torn
- Information Systems Laboratories, Information Systems Group, Sharp Corporation
Search this article
Description
A voice-activated word processor has been realized which provides text input by articulated, phrase-by-phrase Japanese speech, using an automatic learning algorithm to improve syllable recognition accuracy. A speaker-dependent, continuous-phrase speech recognizer incorporated into the word processor is capable of real-time recognition using 111 monosyllables as the basic recognition units. The recognizer is trained by new user to construct 590 reference syllable templates based on uttering a set of words, necessary for syllable template-matching process. Syllable templates are continually updated during use of the word processor through automatic learning. The automatic learning algorithm replaces low-accuracy syllable templates with new patterns extracted from the input speech. The replacement is sensitive to syllable context and is carried out based on recent and longer term history of the recognition accuracy of each syllable. Such learning information is derived from comparison between syllable recognition results and a user-confirmed character string of input phrase as correct, and is accumulated on a phrase-by-phrase basis. The automatic learning algorithm was tested in experiments using Japanese sentences read with pauses between phrases at approximately 4 to 5 syllables per second. The results, using eight speakers, show average syllable recognition accuracy of 82.5% with automatic learning, compared to 71.0% achieved without the learning. Further, the recognition accuracy is increased to 86.5% when the maximum number of syllable templates is increased to 2, 048; both template replacement and template addition are carried out until the maximum number of templates is reached.
Journal
-
- Journal of the Acoustical Society of Japan (E)
-
Journal of the Acoustical Society of Japan (E) 10 (3), 133-142, 1989
Acoustical Society of Japan
- Tweet
Keywords
Details 詳細情報について
-
- CRID
- 1390282680065215232
-
- NII Article ID
- 110003105863
-
- NII Book ID
- AA00256597
-
- ISSN
- 21853509
- 03882861
-
- Text Lang
- en
-
- Data Source
-
- JaLC
- Crossref
- NDL Digital Collections (NII-ELS)
- CiNii Articles
-
- Abstract License Flag
- Disallowed