Voice Activity Detection Using Density Ratio Estimation of Speech and Noise
-
- Tachioka Yuuki
- Information Technology R&D Center, Mitsubishi Electric Corporation
-
- Hanazawa Toshiyuki
- Information Technology R&D Center, Mitsubishi Electric Corporation
-
- Narita Tomohiro
- Information Technology R&D Center, Mitsubishi Electric Corporation
-
- Ishii Jun
- Information Technology R&D Center, Mitsubishi Electric Corporation
Bibliographic Information
- Other Title
-
- 音声と騒音の密度比推定を用いた音声区間検出法
- オンセイ ト ソウオン ノ ミツドヒ スイテイ オ モチイタ オンセイ クカン ケンシュツホウ
Search this article
Abstract
In this paper, we propose a robust voice activity detection (VAD) method that uses a density ratio model. For VAD under highly noisy environments, the likelihood ratio test (LRT) is effective. Conventional LRT constructs speech and noise models, calculates the likelihood of each model, and takes the ratio of those likelihoods to detect speech. Although some improved LRT have been proposed, in conventional LRT, it has not been taken into account that the likelihood ratio of speech and noise model is required, not the likelihood of each model. The proposed method directly estimates the likelihood ratio without calculating each likelihood using an density ratio model obtained in advance by density ratio estimation procedure. Moreover, there is the problem of determining thresholds, which are used for VAD and significantly affect its performance. We propose a method that automatically determines thresholds using discriminant analysis. The experiments show that the proposed method is more effective than conventional methods especially under non-stationary noisy environments.
Journal
-
- IEEJ Transactions on Electronics, Information and Systems
-
IEEJ Transactions on Electronics, Information and Systems 133 (8), 1549-1555, 2013
The Institute of Electrical Engineers of Japan
- Tweet
Keywords
Details 詳細情報について
-
- CRID
- 1390001204608787584
-
- NII Article ID
- 10031189047
-
- NII Book ID
- AN10065950
-
- ISSN
- 13488155
- 03854221
-
- NDL BIB ID
- 024846801
-
- Text Lang
- ja
-
- Data Source
-
- JaLC
- NDL
- Crossref
- CiNii Articles
-
- Abstract License Flag
- Disallowed