Deep neural network techniques for monaural speech enhancement and separation: state of the art analysis

Peter Ochieng

doi:10.1007/s10462-023-10612-2

Abstract

AbstractDeep neural networks (DNN) techniques have become pervasive in domains such as natural language processing and computer vision. They have achieved great success in tasks such as machine translation and image generation. Due to their success, these data driven techniques have been applied in audio domain. More specifically, DNN models have been applied in speech enhancement and separation to perform speech denoising, dereverberation, speaker extraction and speaker separation. In this paper, we review the current DNN techniques being employed to achieve speech enhancement and separation. The review looks at the whole pipeline of speech enhancement and separation techniques from feature extraction, how DNN-based tools models both global and local features of speech, model training (supervised and unsupervised) to how they address label ambiguity problem. The review also covers the use of domain adaptation techniques and pre-trained models to boost speech enhancement process. By this, we hope to provide an all inclusive reference of all the state of art DNN based techniques being applied in the domain of speech separation and enhancement. We further discuss future research directions. This survey can be used by both academic researchers and industry practitioners working in speech separation and enhancement domain.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Artificial Intelligence Review	Publication Date: Oct 25, 2023
Citations: 6	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Deep neural network techniques for monaural speech enhancement and separation: state of the art analysis

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence Review

Lead the way for us

Similar Papers

A unified speaker-dependent speech separation and enhancement system based on deep neural networks
Tian Gao ... Li-Rong Dai
-
Tian Gao, et. al.Tian Gao ... Li-Rong Dai
01 Jul 2015
01 Jul 2015

ADVANCING GASIFICATION-COMBINED UP AND DOWN DRAFT GASIFIER-BASED TREATMENT OF TEXTILE WASTE: ASSESSING FEASIBILITY, ENVIRONMENTAL IMPACTS AND ENERGY RECOVERY POTENTIAL
Jayaprakash S ... Rengalakshmanan S2
INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT | VOL. 07
Jayaprakash S, et. al.Jayaprakash S ... Rengalakshmanan S2
01 Oct 2023
INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT | VOL. 07

Single-Microphone Speech Enhancement and Separation Using Deep Learning
Morten Kolbæk
-
Morten KolbækMorten Kolbæk
31 Aug 2018
31 Aug 2018

Attentive Training: A New Training Framework for Speech Enhancement.
Ashutosh Pandey ... Deliang Wang
IEEE/ACM transactions on audio, speech, and language processing | VOL. 31
Ashutosh Pandey, et. al.Ashutosh Pandey ... Deliang Wang
01 Jan 2023
IEEE/ACM transactions on audio, speech, and language processing | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep neural network techniques for monaural speech enhancement and separation: state of the art analysis

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence Review