Instantaneous PSD Estimation for Speech Enhancement based on Generalized Principal Components

Thomas Dietzen,Toon Van Waterschoot,Marc Moonen

doi:10.23919/eusipco47968.2020.9287839

Abstract

Power spectral density (PSD) estimates of various microphone signal components are essential to many speech enhancement procedures. As speech is highly non-nonstationary, performance improvements may be gained by maintaining time-variations in PSD estimates. In this paper, we propose an instantaneous PSD estimation approach based on generalized principal components. Similarly to other eigenspace-based PSD estimation approaches, we rely on recursive averaging in order to obtain a microphone signal correlation matrix estimate to be decomposed. However, instead of estimating the PSDs directly from the temporally smooth generalized eigenvalues of this matrix, yielding temporally smooth PSD estimates, we propose to estimate the PSDs from newly defined instantaneous generalized eigenvalues, yielding instantaneous PSD estimates. The instantaneous generalized eigenvalues are defined from the generalized principal components, i.e. a generalized eigenvector-based transform of the microphone signals. We further show that the smooth generalized eigenvalues can be understood as a recursive average of the instantaneous generalized eigenvalues. Simulation results comparing the multi-channel Wiener filter (MWF) with smooth and instantaneous PSD estimates indicate better speech enhancement performance for the latter. A MATLAB implementation is available online.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Instantaneous PSD Estimation for Speech Enhancement based on Generalized Principal Components

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Maximum likelihood PSD estimation for speech enhancement in reverberant and noisy conditions
Adam Kuklasinski ... Jesper Jensen
-
Adam Kuklasinski, et. al.Adam Kuklasinski ... Jesper Jensen
01 Mar 2016
01 Mar 2016

Improved Speech Enhancement Considering Speech PSD Uncertainty
Minseung Kim ... Jong Won Shin
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30
Minseung Kim, et. al.Minseung Kim ... Jong Won Shin
01 Jan 2021
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30

Analysis of Eigenvalue Decomposition-Based Late Reverberation Power Spectral Density Estimation
Ina Kodrasi ... Simon Doclo
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 26
Ina Kodrasi, et. al.Ina Kodrasi ... Simon Doclo
01 Jun 2018
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 26

Maximum Likelihood PSD Estimation for Speech Enhancement in Reverberation and Noise
Adam Kuklasinski ... Soren Holdt Jensen
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24
Adam Kuklasinski, et. al.Adam Kuklasinski ... Soren Holdt Jensen
03 Jul 2016
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Instantaneous PSD Estimation for Speech Enhancement based on Generalized Principal Components

Abstract

Talk to us

Similar Papers