Experimental evaluation of the effect of phoneme time stretching on speaker embedding

DOI Web Site 参考文献5件 オープンアクセス
  • Fukawa Taichi
    Systems Information Engineering, Graduate School of Integrative Science and Engineering, Tokyo City University
  • Jin'no Kenya
    Systems Information Engineering, Graduate School of Integrative Science and Engineering, Tokyo City University

抄録

<p>For an indefinite length spectrogram sequence of phonemes, we experimentally verified two methods of obtaining speaker embedding by transforming it to fixed length: adding padding and time stretching. We confirmed that both methods can maintain the extraction performance. We also confirm that the fixed frame length does not affect the results.</p>

収録刊行物

参考文献 (5)*注記

もっと見る

関連プロジェクト

もっと見る

詳細情報 詳細情報について

問題の指摘

ページトップへ