Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition

Chris Donahue,Rohit Prabhavalkar,Bo Li

doi:10.1109/icassp.2018.8462581

Abstract

We investigate the effectiveness of generative adversarial networks (GANs) for speech enhancement, in the context of improving noise robustness of automatic speech recognition (ASR) systems. Prior work [1] demonstrates that GANs can effectively suppress additive noise in raw waveform speech signals, improving perceptual quality metrics; however this technique was not justified in the context of ASR. In this work, we conduct a detailed study to measure the effectiveness of GANs in enhancing speech contaminated by both additive and reverberant noise. Motivated by recent advances in image processing [2], we propose operating GANs on log-Mel filterbank spectra instead of waveforms, which requires less computation and is more robust to reverberant noise. While GAN enhancement improves the performance of a clean-trained ASR system on noisy speech, it falls short of the performance achieved by conventional multi-style training (MTR). By appending the GAN-enhanced features to the noisy inputs and retraining, we achieve a 7% WER improvement relative to the MTR system.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Mutual-optimization Towards Generative Adversarial Networks For Robust Speech Recognition
Ke Ding ... Kaile Su
-
Ke Ding, et. al.Ke Ding ... Kaile Su
01 Aug 2018
01 Aug 2018

Data augmentation using generative adversarial networks for robust speech recognition
Yanmin Qian ... Tian Tan
Speech Communication | VOL. 114
Yanmin Qian, et. al.Yanmin Qian ... Tian Tan
19 Aug 2019
Speech Communication | VOL. 114

Improved Speech Enhancement Using a Time-Domain GAN with Mask Learning
Ju Lin ... Jerome L Mcclendon
-
Ju Lin, et. al.Ju Lin ... Jerome L Mcclendon
25 Oct 2020
25 Oct 2020

Adversarial Training with Gated Convolutional Neural Networks for Robust Speech Recognition
Xudong Lv ... Xin Wang
-
Xudong Lv, et. al.Xudong Lv ... Xin Wang
01 Nov 2021
01 Nov 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition

Abstract

Talk to us

Similar Papers