Melodic Emotional Expression Increases Ease of Talking to Spoken Dialog Agents

Tomomi Takahashi, Kazuaki Tanaka, Kenichiro Kobayashi, Natsuki Oka

doi:10.1145/3472307.3484180

Many people, especially Japanese, are embarrassed to converse with agents such as virtual assistants, probably due to a low social presence, which refers to the degree to which one perceives the human-like properties of an agent. We assumed that poor emotional expressions of agents may impair their human-likeness. In this study, we proposed melodic emotional expression (MEE), which is a new auditory emotional expression for spoken dialog agents. We added background music (BGM) and sound effects as MEE to synthetic voices and conducted experiments to investigate the effects. First, we found that adding MEE to the flat synthetic voice could convey emotions as intended. We also found that when positive emotions were expressed by MEE, it made the agent more human-like and easier to talk to. Furthermore, we achieved these effects when MEE was added to an emotional synthetic voice. These effects were particularly noticeable with the BGM. We further attempted automatic BGM generation, which is necessary for the practical application of MEE. Listeners accurately categorized the BGM generated by the prototype system into four types of emotions: joy, angry, sad, and relaxed.

Melodic Emotional Expression Increases Ease of Talking to Spoken Dialog Agents

説明

収録刊行物

参考文献 (27)*注記

関連プロジェクト

詳細情報詳細情報について

書き出し

問題の指摘

Melodic Emotional Expression Increases Ease of Talking to Spoken Dialog Agents

説明

収録刊行物

参考文献 (27)*注記

関連プロジェクト

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について