Supervised single-channel speech dereverberation and denoising using a two-stage model based sparse representation

Long Zhang,Xu Xu,Huang Chen,Jiaxu Chen,Zhongfu Ye

doi:10.1016/j.specom.2017.12.012

Abstract

In many acoustic conditions, a single-channel recorded speech signal may be severely affected by reverberation and noise, leading to a reduced speech quality and intelligibility. This paper focuses on proposing a novel two-stage model scheme by decomposing room impulse responses (RIRs) into two convolution parts for single-channel speech dereverberation and denoising. Similar as previous methods, the proposed two-stage model uses non-negative approximations of the convolutive transfer function (NCTF) to simultaneously estimate the magnitude spectrograms of the speech and the RIR. It focuses on iteratively updating model parameters to estimate a less reverberant speech signal and a short RIR at first stage, then the clean speech signal and the other short RIR are estimated by iteratively renewing at the second stage. There are always denosing processing steps existing in both stages to denoise more thoroughly. A straightforward method based on the scheme is built to enhance the speech from the noisy reverberant signal, then two fusion methods inspired by ensemble learning are proposed for speech enhancement. The advantages of our proposed methods are more capable to enhance the speech and more time-saving through decomposing the long RIRs into two shorter ones. Additionally, the optimal estimator is derived based on temporal stacking to utilize speech temporal dynamics. Experiments are performed on two simulated RIRs and a real RIR to compare the performances of the proposed methods with a state-of-the-art method and the results show that the proposed methods have achieved either better or comparable performances in most measures but phone error rate.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Supervised single-channel speech dereverberation and denoising using a two-stage model based sparse representation

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Journal: Speech Communication	Publication Date: Dec 27, 2017
Citations: 3

Similar Papers

Supervised single-channel speech dereverberation and denoising using a two-stage processing
Long Zhang ... Jiafei Fu
-
Long Zhang, et. al.Long Zhang ... Jiafei Fu
01 Jun 2017
01 Jun 2017

Recurrent Neural Networks for Cochannel Speech Separation in Reverberant Environments
Masood Delfarah ... Deliang Wang
-
Masood Delfarah, et. al.Masood Delfarah ... Deliang Wang
01 Apr 2018
01 Apr 2018

Statistical models for speech dereverberation
Takuya Yoshioka ... Hirokazu Kameoka
-
Takuya Yoshioka, et. al.Takuya Yoshioka ... Hirokazu Kameoka
01 Oct 2009
01 Oct 2009

A Single-Channel Non-Intrusive C50 Estimator Correlated With Speech Recognition Performance
Pablo Peso Parada ... Daniel Barreda
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24
Pablo Peso Parada, et. al.Pablo Peso Parada ... Daniel Barreda
01 Apr 2016
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Supervised single-channel speech dereverberation and denoising using a two-stage model based sparse representation

Abstract

Talk to us

Similar Papers

More From: Speech Communication