The inaccurate estimates of the speech and noise linear prediction coefficients (LPCs) introduce bias in augmented Kalman filter (AKF) gain, which impacts the quality and intelligibility of enhanced speech. Although current tuning methods offset the bias in AKF gain, particularly in colored noise conditions, they do not adequately address nonstationary noise conditions. This paper introduces a new tuning algorithm of the AKF gain for speech enhancement in real-life noise conditions. Due to this purpose, a speech presence probability (SPP) method first estimates the noise power spectral density (PSD) from each noisy speech frame to compute the noise LPC parameters. A whitening filter is constructed with the noise LPCs to pre-whiten each noisy speech frame prior to computing the speech LPC parameters. The AKF is then constructed with the estimated speech and noise LPC parameters. To achieve better noise reduction, the robustness metric is employed to dynamically offset the bias in AKF gain during speech absence of the noisy speech to that of the sensitivity metric during speech presence. The speech activity is obtained through adopting the speech and noise production model parameters. It is shown that the reduced-biased AKF gain achieved by the proposed tuning algorithm addresses speech enhancement in real-life noise conditions. Objective and subjective scores on the NOIZEUS corpus demonstrate that the proposed method produces enhanced speech with higher quality and intelligibility than the competing methods in real-life noise conditions for a wide range of signal-to-noise ratio (SNR) levels.
Read full abstract