Data Augmentation Using Spectral Structure for Supervised Monaural Source Separation of Frog Choruses

DOI

Bibliographic Information

Other Title
  • カエルの合唱音声に対する教師ありモノラル音源分離のためのスペクトル構造を用いたデータ拡張

Abstract

<p>Sound source separation, which separates the individual sounds from the mixture, is necessary to analyze interaction between individuals in frog chorus. Supervised monaural source separation is promising for frogs, because they are crowded in groups and their positions to the microphone are fixed while a chorus but unknown before it. Although a large amount of sound data is required to train the separation model, it is difficult to collect data. It is necessary to capture many frogs and record their choruses. We propose to use data augmentation by focusing on the characteristics. We modulate and stretch calls to increase the pattern of the calls in the training data based on the analysis. We conduct a sound source separation experiment for two frogs using the augmented data. We confirmed the effectiveness of the data augmentation by the signal-to-distortion ratio.</p>

Journal

Details 詳細情報について

Report a problem

Back to top