A New Ratio Mask Representation for CASA-Based Speech Enhancement

Feng Bao,Waleed H Abdulla

doi:10.1109/taslp.2018.2868407

Abstract

In the computational auditory scene analysis method, the ideal ratio mask or alternatively the ideal binary mask is the key point to reconstruct the enhanced signal. The ratio mask in its Wiener filtering or its square root form is currently considered. However, this kind of ratio mask overlooked one important issue. It does not exploit the inter-channel correlation (ICC) in the noisy speech, noise, and clean speech spectra. Thus, in this paper, we first propose a novel ratio mask representation by utilizing the ICC. In this way, we adaptively reallocate the power ratio of the speech and noise during the construction of ratio mask, thus, more speech and noise components are retained and masked at the same time, respectively. Second, the channel-weight contour based on the equal loudness hearing attribute is adopted to revise this new ratio mask in each Gammatone filterbank channel. Finally, the revised ratio mask is effectively used to train a five-layer structured deep neural network. Experiments show that the proposed ratio mask performs better than the conventional ratio mask representation and other series of enhancement algorithms in terms of speech quality, intelligibility, and spectral distortion under different signal to noise ratio conditions using six types of noises.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A New Ratio Mask Representation for CASA-Based Speech Enhancement

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE/ACM Transactions on Audio, Speech, and Language Processing	Publication Date: Jan 1, 2019
Citations: 60

Similar Papers

Speech enhancement based on simple recurrent unit network
Xingyue Cui ... Fuliang Yin
Applied Acoustics | VOL. 157
Xingyue Cui, et. al.Xingyue Cui ... Fuliang Yin
16 Sep 2019
Applied Acoustics | VOL. 157

Unseen Noise Estimation Using Separable Deep Auto Encoder for Speech Enhancement
Meng Sun ... Hugo Van Hamme
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24
Meng Sun, et. al.Meng Sun ... Hugo Van Hamme
01 Jan 2015
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24

Bayesian Multichannel Speech Enhancement with a Deep Speech Prior
Kouhei Sekiguchi ... Tatsuya Kawahara
-
Kouhei Sekiguchi, et. al.Kouhei Sekiguchi ... Tatsuya Kawahara
01 Nov 2018
01 Nov 2018

A Fully Convolutional Neural Network for Speech Enhancement
Se Rim Park ... Jin Won Lee
-
Se Rim Park, et. al.Se Rim Park ... Jin Won Lee
20 Aug 2017
20 Aug 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A New Ratio Mask Representation for CASA-Based Speech Enhancement

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing