Noise Modeling to Build Training Sets for Robust Speech Enhancement

Yahui Wang,Yongbiao Wang,Wenxi Zhang,Zhou Wu,Hongxin Zhang,Xinxin Kong

doi:10.3390/app12041905

Abstract

DNN-based Speech Enhancement (SE) models suffer from significant performance degradation in real recordings due to the mismatch between the synthetic datasets employed for training and real test sets. To solve this problem, we propose a new Generative Adversarial Network framework for Noise Modeling (NM-GAN) that creates realistic paired training sets by imitating real noise distribution. The proposed framework combines a novel 7-layer U-Net with two bidirectional long short-term memory (LSTM) layers that act as a generator to construct complex noise. NM-GAN generates enough recall (diversity) and precision (noise quality) in its samples through adversarial and alternate training, effectively simulating real noise, which is then utilized to compose realistic paired training sets. Extensive experiments employing various qualitative and quantitative evaluation metrics verify the effectiveness of the generated noise samples and training sets, demonstrating our framework’s capabilities.

Highlights

IntroductionAcademic Editors: Andrea Prati, Published: 11 February 2022
Academic Editors: Andrea Prati, Published: 11 February 2022Speech enhancement [1] (SE) is the extraction of speech signals while suppressing sources of interference and eliminating noise
Given the importance of realistic datasets, this paper focuses on developing a Generative Adversarial Nets (GANs) that effectively models noise and creates synthetic but highly credible training sets

Summary

Introduction

Academic Editors: Andrea Prati, Published: 11 February 2022. Speech enhancement [1] (SE) is the extraction of speech signals while suppressing sources of interference and eliminating noise. SE plays an important role in improving the intelligibility and quality of noisy speech recordings. Deep Neural Network (DNN)-based SE methods have received significant attention as part of a broader interest in learning-related Artificial Intelligence (AI). Neural Networks (RNNs) [2,3] and Generative Adversarial Nets (GANs) [4,5], along with other DNN-based architectures [6,7,8] that have already been explored in SE tasks. Problem Statement with regard to jurisdictional claims in

Objectives

Findings

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Feb 11, 2022
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Noise Modeling to Build Training Sets for Robust Speech Enhancement

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

A Joint Long Short-Term Memory and AdaBoost regression approach with application to remaining useful life estimation
Xiaoyan Zhu ... Min Xie
Measurement | VOL. 170
Xiaoyan Zhu, et. al.Xiaoyan Zhu ... Min Xie
12 Nov 2020
Measurement | VOL. 170

An Enhanced Deep Learning Approach in Forecasting Banana Harvest Yields
Mariannie A Rebortera ... Arnel C
International Journal of Advanced Computer Science and Applications | VOL. 10
Mariannie A Rebortera, et. al.Mariannie A Rebortera ... Arnel C
01 Jan 2019
International Journal of Advanced Computer Science and Applications | VOL. 10

Enhancing Large-Scale Hydro-Climate Services Through a Regionalized Machine Learning Approach
Yiheng Du ... Ilias G Pechlivanidis
-
Yiheng Du, et. al.Yiheng Du ... Ilias G Pechlivanidis
08 Mar 2024
08 Mar 2024

Generation of quantification maps and weighted images from synthetic magnetic resonance imaging using deep learning network
Yawen Liu ... Wenjuan Liu
Physics in Medicine & Biology | VOL. 67
Yawen Liu, et. al.Yawen Liu ... Wenjuan Liu
17 Jan 2022
Physics in Medicine & Biology | VOL. 67

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Noise Modeling to Build Training Sets for Robust Speech Enhancement

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences