Study for Prosodic Control Command Generation of Synthetic Speech

Bibliographic Information

Other Title
  • 音声合成のための韻律制御コマンド作成方法の検討
  • オンセイ ゴウセイ ノ タメ ノ インリツ セイギョ コマンド サクセイ ホウホウ ノ ケントウ

Search this article

Abstract

The Multi-layered Speech/Sound Synthesis Control Language (MSCL) proposed herein facilitates the synthesizing of several speech modes such as nuance, mental state and emotion, and allows speech to be synchronized to other media easily. MSCL is a multi-layered linguistic system and encompasses three layers: and semantic level layer (The S-layer), interpretation level layer (The I-layer), and parameter level layer (The P-layer). This multi-level description system is convenient for both laymen and professional users. Furthermore, research was conducted into mental state tendencies using a test that examined the perceptions of the subject’s sensibility to the control of synthetic speech prosody. The results showed the relationships between prosodic control rules and non-verbal expressions. These relationships are of use for constructing semantic prosody control. This paper describes these functions and the effective prosodic feature controls possible with MSCL.

Journal

References(25)*help

See more

Details 詳細情報について

Report a problem

Back to top