Approach of features with confident weight for robust speech recognition

Lingnan Ge, Shirai Katsuhiko, Kurematsu Akira

doi:10.1250/ast.32.92

この論文をさがす

説明

The enhancement of speech has become one of the focuses of automatic speech recognition (ASR) development. In recent studies, the missing feature approach (MFA) has been proved to be a suitable method. However the hard mask decision in the MFA is mostly a rough binary classifier on the basis of a certain threshold value that could cause a failed decision of reliability and result in a signal screening risk. As improvements of the hard mask the effectiveness of soft masks, including soft mask works with a Bayesian classifier, attempt to compensate the loss of real speech in the hard mask decision by discovering the probability density function (p.d.f.) of the unreliable feature component. Unfortunately, this is a very difficult task because of the overlap of at least two complex random processes. The sigmoid function suggested by some soft masks is not a reasonable p.d.f. In this paper, we provide an analysis of the confident degree of a feature component in a subband based on four criteria and then propose four types of confident weight (CWs). Based on CWs, we introduce four classes of approaches of feature with confident weight (AFCWs), which estimate the confidence degree of each feature vector simply and efficiently, describe the effect of noise in a rigorous manner, and eliminate the risk of selecting thresholds and the difficulty of finding a joint p.d.f. of reliable and unreliable components. Experimental results have shown that the proposed approaches improve the performances of ASR systems even in an adverse environment.

収録刊行物

Acoustical Science and Technology

Acoustical Science and Technology 32 (3), 92-99, 2011

一般社団法人日本音響学会

キーワード

詳細情報詳細情報について

CRID: 1390001205090687360

NII論文ID: 130000727459

NII書誌ID: AA11501808

DOI: 10.1250/ast.32.92

ISSN: 13475177; 03694232; 13463969

NDL書誌ID: 11060138

Web Site: http://id.ndl.go.jp/bib/11060138; https://ndlsearch.ndl.go.jp/books/R000000004-I11060138; http://www.jstage.jst.go.jp/article/ast/32/3/32_3_92/_pdf

本文言語コード: en

データソース種別

JaLC
NDLサーチ
Crossref
CiNii Articles
OpenAIRE

抄録ライセンスフラグ: 使用不可

書き出し

問題の指摘