Formula omitted]-law SGAN for generating spectra with more details in speech enhancement

Hongfeng Li,Yanyan Xu,Dengfeng Ke,Kaile Su

doi:10.1016/j.neunet.2020.12.017

Abstract

The goal of monaural speech enhancement is to separate clean speech from noisy speech. Recently, many studies have employed generative adversarial networks (GAN) to deal with monaural speech enhancement tasks. When using generative adversarial networks for this task, the output of the generator is a speech waveform or a spectrum, such as a magnitude spectrum, a mel-spectrum or a complex-valued spectrum. The spectra generated by current speech enhancement methods in the time–frequency domain usually lack details, such as consonants and harmonics with low energy. In this paper, we propose a new type of adversarial training framework for spectrum generation, named μ-law spectrum generative adversarial networks (μ-law SGAN). We introduce a trainable μ-law spectrum compression layer (USCL) into the proposed discriminator to compress the dynamic range of the spectrum. As a result, the compressed spectrum can display more detailed information. In addition, we use the spectrum transformed by USCL to regularize the generator’s training, so that the generator can pay more attention to the details of the spectrum. Experimental results on the open dataset Voice Bank + DEMAND show that μ-law SGAN is an effective generative adversarial architecture for speech enhancement. Moreover, visual spectrogram analysis suggests that μ-law SGAN pays more attention to the enhancement of low energy harmonics and consonants.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Formula omitted]-law SGAN for generating spectra with more details in speech enhancement

Abstract

Talk to us

Similar Papers

More From: Neural networks : the official journal of the International Neural Network Society

Lead the way for us

Journal: Neural networks : the official journal of the International Neural Network Society	Publication Date: Dec 25, 2020
Citations: 8

Similar Papers

Joint Ideal Ratio Mask and Generative Adversarial Networks for Monaural Speech Enhancement
Jing Yuan ... Changchun Bao
-
Jing Yuan, et. al.Jing Yuan ... Changchun Bao
01 Aug 2018
01 Aug 2018

Speech Enhancement Generative Adversarial Network Architecture with Gated Linear Units and Dual-Path Transformers
Dehui Zhang ... Anming Dong
-
Dehui Zhang, et. al.Dehui Zhang ... Anming Dong
09 Oct 2022
09 Oct 2022

Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition
Chris Donahue ... Rohit Prabhavalkar
-
Chris Donahue, et. al.Chris Donahue ... Rohit Prabhavalkar
01 Apr 2018
01 Apr 2018

Lightweight End-to-End Speech Enhancement Generative Adversarial Network Using Sinc Convolutions
Lujun Li ... Wudamu
Applied sciences | VOL. 11
Lujun Li, et. al.Lujun Li ... Wudamu
18 Aug 2021
Applied sciences | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Formula omitted]-law SGAN for generating spectra with more details in speech enhancement

Abstract

Talk to us

Similar Papers

More From: Neural networks : the official journal of the International Neural Network Society