Conditional MMSE-based single-channel speech enhancement using inter-frame and inter-band correlations

Hajar Momeni,Emanuel A P Habets,Hamid Reza Abutalebi

doi:10.1109/icassp.2016.7472672

Abstract

Obtaining an estimate of clean speech for each time-frequency (TF) unit continues to be of importance in single-channel speech enhancement. Recently, it has been proposed to exploit inter-frame and interband correlations in a variety of speech processing applications. To estimate the clean speech, we propose in this contribution a conditional minimum mean squared error (MMSE)-based filter which exploits both inter-frame and inter-band correlations and takes into account the speech presence uncertainty. The speech presence uncertainty is provided by a recently proposed a posteriori speech presence probability (SPP) estimator that can also take into account the inter-frame and inter-band correlations. Simulation results demonstrate that the conditional MMSE-based filter in combination with the previously proposed SPP estimator and a fixed a priori SPP results in less distorted speech compared to the other SPP estimators.

Full Text