Constructing text-to-speech systems for languages with unknown pronunciations

Sawada Kei, Hashimoto Kei, Oura Keiichiro, Nankaku Yoshihiko, Tokuda Keiichi

doi:10.1250/ast.39.119

この論文をさがす

抄録

This paper proposes a method for constructing text-to-speech (TTS) systems for languages with unknown pronunciations. One goal of speech synthesis research is to establish a framework that can be used to construct TTS systems for any written language. Generally, language-specific knowledge is required to construct TTS systems for a new language. However, it is difficult to acquire language-specific knowledge in each new language. Therefore, constructing a TTS system for a new language entails huge costs. To address this problem, we investigate a framework for automatically constructing a TTS system from a target language database consisting of only speech data and corresponding Unicode texts. In the proposed method, pseudo phonetic information of the target language with unknown pronunciation is obtained by a speech recognizer of a rich-resource proxy language. Then, a grapheme-to-phoneme converter and a statistical parametric speech synthesizer are constructed based on the obtained pseudo phonetic information. The proposed method was applied to Japanese and was evaluated in terms of objective and subjective measures. Additionally, we challenged the construction of TTS systems for nine Indian languages using the proposed method, and TTS systems were evaluated in the Blizzard Challenge 2014 and 2015.

収録刊行物

Acoustical Science and Technology

Acoustical Science and Technology 39 (2), 119-129, 2018

一般社団法人日本音響学会

キーワード

詳細情報詳細情報について

CRID: 1390001205088986240

NII論文ID: 130006407740

NII書誌ID: AA11501808

DOI: 10.1250/ast.39.119

ISSN: 13475177; 03694232; 13463969

NDL書誌ID: 028863144

Web Site: https://ndlsearch.ndl.go.jp/books/R000000004-I028863144; https://www.jstage.jst.go.jp/article/ast/39/2/39_E1734/_pdf

本文言語コード: en

データソース種別

JaLC
NDL
Crossref
CiNii Articles

抄録ライセンスフラグ: 使用不可

Constructing text-to-speech systems for languages with unknown pronunciations

この論文をさがす

抄録

収録刊行物

参考文献 (10)*注記

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

Constructing text-to-speech systems for languages with unknown pronunciations

この論文をさがす

抄録

収録刊行物

参考文献 (10)*注記

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について