Data Augmentation Strategies for Neural Network F0 Estimation

Manu Airaksinen,Lauri Juvela,Okko Rasanen,Paavo Alku

doi:10.1109/icassp.2019.8683041

Abstract

This study explores various speech data augmentation methods for the task of noise-robust fundamental frequency (F0) estimation with neural networks. The explored augmentation strategies are split into additive noise and channel -based augmentation and into vocoder-based augmentation methods. In vocoder-based augmentation, a glottal vocoder is used to enhance the accuracy of ground truth F0 used for training of the neural network, as well as to expand the training data diversity in terms of F0 patterns and vocal tract lengths of the talkers. Evaluations on the PTDB-TUG corpus indicate that noise and channel augmentation can be used to greatly increase the noise robustness of trained models, and that vocoder-based ground truth enhancement further increases model performance. For smaller datasets, vocoder-based diversity augmentation can also be used to increase performance. The best-performing proposed method greatly outperformed the compared F0 estimation methods in terms of noise robustness.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Data Augmentation Strategies for Neural Network F0 Estimation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Noise robust Zernike phase retrieval via learning based algorithm only with 2-step phase shift measurements.
Hansol Kim ... Yoonchan Jeong
Optics Express | VOL. 31
Hansol Kim, et. al.Hansol Kim ... Yoonchan Jeong
29 Aug 2023
Optics Express | VOL. 31

Comparison of dual rail and an enhanced bundled data asynchronous protocols noise robustness in the GALS NoC link application
Soodeh Aghli Moghaddam ... Parviz Jabedar Maralani
-
Soodeh Aghli Moghaddam, et. al.Soodeh Aghli Moghaddam ... Parviz Jabedar Maralani
01 Oct 2009
01 Oct 2009

Evaluation of Optimization Algorithms and Noise Robustness of Sparsity-Promoting Dynamic Mode Decomposition
Yuto Iwasaki ... Takayuki Nagata
IEEE Access | VOL. 10
Yuto Iwasaki, et. al.Yuto Iwasaki ... Takayuki Nagata
01 Jan 2021
IEEE Access | VOL. 10

Instantaneous Frequency Estimation of FM Signals under Gaussian and Symmetric α-Stable Noise: Deep Learning versus Time–Frequency Analysis
Huda Saleem Razzaq ... Zahir M Hussain
Information | VOL. 14
Huda Saleem Razzaq, et. al.Huda Saleem Razzaq ... Zahir M Hussain
28 Dec 2022
Information | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data Augmentation Strategies for Neural Network F0 Estimation

Abstract

Talk to us

Similar Papers