Abstract
The interchannel phase difference (IPD) may be one of the most widely-used spatial cues in multichannel speech processing, and has been used in beamformers and post filters for speech enhancement. The coherence, which is also used as a feature for speech enhancement, can provide information on the reliability of the IPD for the estimation of the speech presence probability (SPP). In this paper, we propose dual microphone speech enhancement adopting <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">a posteriori</i> SPP estimation based on statistical modeling of the IPD. The marginal distribution of the IPD is derived from the distribution of the relative transfer function which is parameterized with the IPD and coherence, with a single assumption that the observed discrete Fourier transform (DFT) coefficients in each frequency are distributed according to a complex bivariate Gaussian distribution. Given the direction of arrival of the desired signal, the <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">a posteriori</i> SPP is obtained using the IPD distributions with and without the information on the location of the interfering source, and is applied to speech enhancement. Experimental results for various types and locations of noise, signal-to-noise ratios, reverberation times, and locations of the target source showed that the proposed method outperformed previously proposed approaches utilizing IPD information.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.