Bayesian STSA estimation using masking properties and generalized Gamma prior for speech enhancement

Mahdi Parchami,Wei-Ping Zhu,Eric Plourde,Benoit Champagne

doi:10.1186/s13634-015-0270-6

Mahdi Parchami, Wei-Ping Zhu + Show 2 more

Open Access

https://doi.org/10.1186/s13634-015-0270-6

Copy DOI

Abstract

We consider the estimation of the speech short-time spectral amplitude (STSA) using a parametric Bayesian cost function and speech prior distribution. First, new schemes are proposed for the estimation of the cost function parameters, using an initial estimate of the speech STSA along with the noise masking feature of the human auditory system. This information is further employed to derive a new technique for the gain flooring of the STSA estimator. Next, to achieve better compliance with the noisy speech in the estimator’s gain function, we take advantage of the generalized Gamma distribution in order to model the STSA prior and propose an SNR-based scheme for the estimation of its corresponding parameters. It is shown that in Bayesian STSA estimators, the exploitation of a rough STSA estimate in the parameter selection for the cost function and the speech prior leads to more efficient control on the gain function values. Performance evaluation in different noisy scenarios demonstrates the superiority of the proposed methods over the existing parametric STSA estimators in terms of the achieved noise reduction and introduced speech distortion.

Highlights

Speech enhancement aims at the reduction of corrupting noise in speech signals while keeping the introduced speech distortion at the minimum possible level
We present a simple approach for the selection of the Generalized Gamma distribution (GGD) parameter c for the proposed short-time spectral amplitude (STSA) estimator
According to (22), the shape parameter c takes on its values as a linearly increasing function of the SNR in its possible range between cmin and cmax, leading to the appropriate adjustment of the estimator gain function based on the average power of the speech STSA components at each frame

Summary

Introduction

Speech enhancement aims at the reduction of corrupting noise in speech signals while keeping the introduced speech distortion at the minimum possible level. As experiments show, there may appear excessive distortion in the enhanced speech using the STSA estimator with this parameter choice, especially at high SNRs. we propose to use the adaptive approach in (11) as the basis for the selection of β, but to further apply the scheme in (12) as a form of frequency weighting to take into account the psycho-acoustics of the human auditory system within each time frame. According to (22), the shape parameter c takes on its values as a linearly increasing function of the SNR in its possible range between cmin and cmax, leading to the appropriate adjustment of the estimator gain function based on the average power of the speech STSA components at each frame. We employed the gain flooring scheme in (16) in cases where the proposed gain flooring is not used, since the closest results to

Proposed choice of α

Proposed choice of β

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EURASIP Journal on Advances in Signal Processing	Publication Date: Oct 6, 2015
Citations: 27	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Bayesian STSA estimation using masking properties and generalized Gamma prior for speech enhancement

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Advances in Signal Processing

Lead the way for us

Similar Papers

Speech enhancement based on Bayesian decision and spectral amplitude estimation
Feng Deng ... Chang-Chun Bao
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2015
Feng Deng, et. al.Feng Deng ... Chang-Chun Bao
07 Oct 2015
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2015

Speech enhancement using optimal non-linear spectral amplitude estimation
Y Ephraim ... D Malah
-
Y Ephraim, et. al.Y Ephraim ... D Malah
01 Apr 1983
01 Apr 1983

A generalized log-spectral amplitude estimator for single-channel speech enhancement
Aleksej Chinaev ... Reinhold Haeb-Umbach
-
Aleksej Chinaev, et. al.Aleksej Chinaev ... Reinhold Haeb-Umbach
01 Mar 2017
01 Mar 2017

Speech enhancement using generalized maximum a posteriori spectral amplitude estimator
Yu-Cheng Su ... Yu Tsao
-
Yu-Cheng Su, et. al.Yu-Cheng Su ... Yu Tsao
01 May 2013
01 May 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bayesian STSA estimation using masking properties and generalized Gamma prior for speech enhancement

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Advances in Signal Processing