Bayesian marginal statistics for speech enhancement using log Gabor wavelet

Suman Senapati,Neeraj Bhende,Goutam Saha

doi:10.1007/s10772-011-9096-2

Abstract

This paper deals with single-channel speech enhancement technique. Initially, the suitability of Log Gabor Wavelet (LGW) is investigated in speech enhancement approach and a novel speech enhancer by Bayesian Maximum a Posteriori (MAP) based Marginal Statistical Characterization (MSC) is developed. The LGW filters are traditional choice for obtaining localized frequency information and these offer the best simultaneous localization of time and frequency information. The MSC is applied in each scale of the LGW, that means a level dependent shrinkage rule is taken to suppress the background perturbations. The pdf of the LGW filtered speech coefficient is modeled with Generalized Laplacian Distribution (GLD), which allows a high approximation accuracy for Laplace distributed real and imaginary parts of the speech coefficients. The robustness of the proposed framework is tested on NOIZEUS speech corpus against seven different established speech enhancement algorithms. Experimental results show that the proposed estimator yield a higher improvement in Segmental SNR (S-SNR), lower Log Area Ratio (LAR) and Weighted Spectral Slope (WSS) distortion compared to existing speech enhancement algorithms.

Full Text