Generative Speech Enhancement Based on Cloned Networks

Michael Chinen,Jan Skoglund,Felicia S C Lim,W Bastiaan Kleijn

doi:10.1109/waspaa.2019.8937206

Abstract

We propose to implement speech enhancement by the regeneration of clean speech from a salient representation extracted from the noisy signal. The network that extracts salient features is trained using a set of weight-sharing clones of the extractor network. The clones receive mel-frequency spectra of different noisy versions of the same speech signal as input. By encouraging the outputs of the clones to be similar for these different input signals, we train a feature extractor network that is robust to noise. At inference, the salient features form the input to a WaveNet network that generates a natural and clean speech signal with the same attributes as the ground-truth clean signal. As the signal becomes noisier, our system produces natural sounding errors that stay on the speech manifold, in place of traditional artifacts found in other systems. Our experiments confirm that our generative enhancement system provides state-of-the-art enhancement performance within the generative class of enhancers according to a MUSHRA-like test. The clones based system matches or outperforms the other systems at each input signal-to-noise (SNR) range with statistical significance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Generative Speech Enhancement Based on Cloned Networks

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Improving generative adversarial networks for speech enhancement through regularization of latent representations
Fan Yang ... Yonghong Yan
Speech Communication | VOL. 118
Fan Yang, et. al.Fan Yang ... Yonghong Yan
06 Feb 2020
Speech Communication | VOL. 118

A Wavelet-Based De-Noising Speech Signal Performance with Objective Measures
S China Venkateswarlu ... Vallabhuni Vijay
-
S China Venkateswarlu, et. al.S China Venkateswarlu ... Vallabhuni Vijay
14 Sep 2022
14 Sep 2022

Evaluating the perceptual quality of speech signals enhanced using the Modified Phase Opponency model
Om D Deshmukh ... Carol Y Espy‐Wilson
The Journal of the Acoustical Society of America | VOL. 120
Om D Deshmukh, et. al.Om D Deshmukh ... Carol Y Espy‐Wilson
01 Nov 2006
The Journal of the Acoustical Society of America | VOL. 120

Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech
Takuya Fujimura ... Kohei Yatabe
-
Takuya Fujimura, et. al.Takuya Fujimura ... Kohei Yatabe
23 Aug 2021
23 Aug 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generative Speech Enhancement Based on Cloned Networks

Abstract

Talk to us

Similar Papers