A method for estimating vocal-tract shape from a target speech spectrum

Kaburagi Tokihiko

doi:10.1250/ast.36.428

Search this article

Abstract

We present a method to simultaneously estimate the cross-sectional area and length of the vocal tract from a speech spectrum. An iterative procedure determines the vocal-tract shape by gradually optimizing the parameter values to produce the target speech spectrum. The vocal-tract shape is updated in each iteration using a sensitivity function representing the change in formant frequency caused by a slight perturbation of the vocal-tract shape. Our method effectively optimizes the vocal-tract shape when combined with the perturbation relationship between the speech spectrum parameters (i.e., cepstral parameters) and formants. The estimation accuracy is examined using area function data for 10 English vowels (Story and Titze, J. Phon., 26, 223–260, 1998). The resulting average errors are 0.36 cm² for the cross-sectional area and 0.21 cm for the vocal-tract length. This corresponds to a 17.6% and 1.24% error, respectively. The formant frequency recovered from the estimated vocal-tract shape has an error of less than 4% for each of the first four formants. We also determine that the fundamental frequency of the target speech spectrum has an influence on the estimation accuracy.

Journal

Acoustical Science and Technology

Acoustical Science and Technology 36 (5), 428-437, 2015

ACOUSTICAL SOCIETY OF JAPAN

Keywords

Details 詳細情報について

CRID: 1390282680065102592

NII Article ID: 130005097251

NII Book ID: AA11501808

DOI: 10.1250/ast.36.428

ISSN: 13475177; 03694232; 13463969

NDL BIB ID: 026703781

Web Site: https://ndlsearch.ndl.go.jp/books/R000000004-I026703781; https://www.jstage.jst.go.jp/article/ast/36/5/36_E1520/_pdf

Text Lang: en

Data Source

JaLC
NDL
Crossref
CiNii Articles

Abstract License Flag: Disallowed

Export

A method for estimating vocal-tract shape from a target speech spectrum

Search this article

Abstract

Journal

Citations (2)*help

References(22)*help

Keywords

Details 詳細情報について

Export

Report a problem

A method for estimating vocal-tract shape from a target speech spectrum

Search this article

Abstract

Journal

Citations (2)*help

References(22)*help

Keywords

Details 詳細情報について

Export

Report a problem

Project list