Experimental evaluation of the effect of phoneme time stretching on speaker embedding
-
- Fukawa Taichi
- Systems Information Engineering, Graduate School of Integrative Science and Engineering, Tokyo City University
-
- Jin'no Kenya
- Systems Information Engineering, Graduate School of Integrative Science and Engineering, Tokyo City University
抄録
<p>For an indefinite length spectrogram sequence of phonemes, we experimentally verified two methods of obtaining speaker embedding by transforming it to fixed length: adding padding and time stretching. We confirmed that both methods can maintain the extraction performance. We also confirm that the fixed frame length does not affect the results.</p>
収録刊行物
-
- Nonlinear Theory and Its Applications, IEICE
-
Nonlinear Theory and Its Applications, IEICE 13 (2), 277-281, 2022
一般社団法人 電子情報通信学会
- Tweet
詳細情報 詳細情報について
-
- CRID
- 1390573242800278912
-
- ISSN
- 21854106
-
- 本文言語コード
- en
-
- データソース種別
-
- JaLC
- Crossref
- KAKEN
-
- 抄録ライセンスフラグ
- 使用不可