JSUT and JVS: Free Japanese voice corpora for accelerating speech synthesis research

Takamichi Shinnosuke, Sonobe Ryosuke, Mitsui Kentaro, Saito Yuki, Koriyama Tomoki, Tanji Naoko, Saruwatari Hiroshi

doi:10.1250/ast.41.761

JSUT and JVS: Free Japanese voice corpora for accelerating speech synthesis research

DOI Web Site 被引用文献7件参考文献17件オープンアクセス

Takamichi Shinnosuke

Graduate School of Information Science and Technology, The University of Tokyo
Sonobe Ryosuke

Graduate School of Information Science and Technology, The University of Tokyo
Mitsui Kentaro

Graduate School of Information Science and Technology, The University of Tokyo
Saito Yuki

Graduate School of Information Science and Technology, The University of Tokyo
Koriyama Tomoki

Graduate School of Information Science and Technology, The University of Tokyo
Tanji Naoko

Graduate School of Information Science and Technology, The University of Tokyo
Saruwatari Hiroshi

Graduate School of Information Science and Technology, The University of Tokyo

この論文をさがす

CiNii Books

説明

<p>In this paper, we develop two corpora for speech synthesis research. Thanks to improvements in machine learning techniques, including deep learning, speech synthesis is becoming a machine learning task. To accelerate speech synthesis research, we aim at developing Japanese voice corpora reasonably accessible from not only academic institutions but also commercial companies. In this paper, we construct the JSUT and JVS corpora. They are designed mainly for text-to-speech synthesis and voice conversion, respectively. The JSUT corpus contains 10 hours of reading-style speech uttered by a single speaker, and the JVS corpus contains 30 hours containing three styles of speech uttered by 100 speakers. This paper describes how we designed the corpora and summarizes the specifications. The corpora are available at our project pages.</p>

収録刊行物

Acoustical Science and Technology

Acoustical Science and Technology 41 (5), 761-768, 2020-09-01

一般社団法人日本音響学会

被引用文献 (7)*注記

参考文献 (17)*注記

詳細情報詳細情報について

CRID

1390566775163782016
NII論文ID

130007895044
DOI

10.1250/ast.41.761
ISSN

13475177

03694232

13463969
Web Site

https://www.jstage.jst.go.jp/article/ast/41/5/41_E1950/_pdf
本文言語コード

en
データソース種別
- JaLC
- Crossref
- CiNii Articles
- OpenAIRE
抄録ライセンスフラグ
使用不可

書き出し

問題の指摘

ページトップへ

JSUT and JVS: Free Japanese voice corpora for accelerating speech synthesis research

この論文をさがす

説明

収録刊行物

被引用文献 (7)*注記

参考文献 (17)*注記

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について