Supervised Determined Source Separation with Multichannel Variational Autoencoder

Hirokazu Kameoka, Li Li, Shota Inoue, Shoji Makino

doi:10.1162/neco_a_01217

Supervised Determined Source Separation with Multichannel Variational Autoencoder

DOI Web Site 10 Citations 27 References Open Access

Hirokazu Kameoka

Nippon Telegraph and Telephone Corporation, Kanagawa, 243-0198, Japan
Li Li

University of Tsukuba, Ibaraki, 305-8577, Japan
Shota Inoue

University of Tsukuba, Ibaraki, 305-8577, Japan
Shoji Makino

University of Tsukuba, Ibaraki, 305-8577, Japan

Abstract

<jats:p> This letter proposes a multichannel source separation technique, the multichannel variational autoencoder (MVAE) method, which uses a conditional VAE (CVAE) to model and estimate the power spectrograms of the sources in a mixture. By training the CVAE using the spectrograms of training examples with source-class labels, we can use the trained decoder distribution as a universal generative model capable of generating spectrograms conditioned on a specified class index. By treating the latent space variables and the class index as the unknown parameters of this generative model, we can develop a convergence-guaranteed algorithm for supervised determined source separation that consists of iteratively estimating the power spectrograms of the underlying sources, as well as the separation matrices. In experimental evaluations, our MVAE produced better separation performance than a baseline method. </jats:p>

Journal

Neural Computation

Neural Computation 31 (9), 1891-1914, 2019-09

MIT Press - Journals

Citations (10)*help

References(27)*help

Related Projects

Details 詳細情報について

CRID

1361975846308410496
DOI

10.1162/neco_a_01217
ISSN

1530888X

08997667
Web Site

https://www.mitpressjournals.org/doi/pdf/10.1162/neco_a_01217
Data Source
- Crossref
- KAKEN

Supervised Determined Source Separation with Multichannel Variational Autoencoder

Abstract

Journal

Citations (10)*help

References(27)*help

Related Projects

Details 詳細情報について

Export

Report a problem