Abstract

In this paper, we present a novel single-channel speech enhancement method in the Discrete Fourier Transform (DFT) domain. Here, the amplitude of DFT coefficients of a clean speech signal is modeled by a Weibull probability density function. Measuring the Jensen-Shannon divergence (JSD), Weibull distribution showed a better fit to clean speech signal compared to the previously fitted distributions such as gamma and Rayleigh. Therefore, we modify the Minimum Mean Square Error (MMSE) estimation algorithm for speech enhancement considering Weibull speech priors and Gaussian additive noise signals. The enhanced speech signals are assessed based on the perceptual evaluation of speech quality (PESQ) and segmental signal-to-noise ratio (SEG-SNR) criteria. Extensive simulation experiments on speech signals degraded by various additive non-stationary noise sources demonstrate that performance improvements are possible employing Weibull speech priors in the MMSE-based speech enhancement algorithm compared to the Rayleigh and Gamma PDFs.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.