Speech Presence Probability Estimation Research Articles

In this paper, we present and compare novel algorithms to localize simultaneous speakers using four microphones distributed on a pair of binaural hearing aids. The framework consists of two groups of localization algorithms, namely, beamforming-based and statistical model based localization algorithms. We first generalize our previously proposed methods based on beamforming techniques to the binaural configuration with 2 $\times$ 2 microphones. Next, we contribute two statistical model based methods for binaural localization using the maximum likelihood approach that also takes head-related transfer functions and unknown noise conditions into account. The methods enable the localization of multiple source positions for all azimuth angles and do not require prior training of binaural cues. The proposed localization algorithms are integrated into a generalized side-lobe canceller (GSC) to extract the desired speaker in the presence of competing speakers and background noise and when the head of the listener turns. The GSC components are adapted with the frequency-wise target presence probability and the frame-wise broadband direction-of-arrival (DOA) estimates that track the turns of the listener's head. We evaluate the performance of the localization algorithms individually and also in the context of the adaptive binaural beamformer in various noisy and reverberant conditions. Finally, we introduce a new adaptive beamformer, which combines the GSC with multichannel speech presence probability estimation and achieves superior source separation performance in noisy environment.

A reliable speech presence probability (SPP) estimator is important to many frequency domain speech enhancement algorithms. It is known that a good estimate of SPP can be obtained by having a smooth a-posteriori signal to noise ratio (SNR) function, which can be achieved by reducing the noise variance when estimating the speech power spectrum. Recently, the wavelet denoising with multitaper spectrum (MTS) estimation technique was suggested for such purpose. However, traditional approaches directly make use of the wavelet shrinkage denoiser which has not been fully optimized for denoising the MTS of noisy speech signals. In this paper, we firstly propose a two-stage wavelet denoising algorithm for estimating the speech power spectrum. First, we apply the wavelet transform to the periodogram of a noisy speech signal. Using the resulting wavelet coefficients, an oracle is developed to indicate the approximate locations of the noise floor in the periodogram. Second, we make use of the oracle developed in stage 1 to selectively remove the wavelet coefficients of the noise floor in the log MTS of the noisy speech. The wavelet coefficients that remained are then used to reconstruct a denoised MTS and in turn generate a smooth a-posteriori SNR function. To adapt to the enhanced a-posteriori SNR function, we further propose a new method to estimate the generalized likelihood ratio (GLR), which is an essential parameter for SPP estimation. Simulation results show that the new SPP estimator outperforms the traditional approaches and enables an improvement in both the quality and intelligibility of the enhanced speeches.

Speech Presence Probability Estimation Research Articles

Related Topics

Articles published on Speech Presence Probability Estimation

Incorporation of a modified temporal cepstrum smoothing in both signal-to-noise ratio and speech presence probability estimation for speech enhancement

An Analysis of Traditional Noise Power Spectral Density Estimators Based on the Gaussian Stochastic Volatility Model

Dual Microphone Speech Enhancement Based on Statistical Modeling of Interchannel Phase Difference

Speech enhancement using modified wiener filter based MMSE and speech presence probability estimation

Distributed Speech Presence Probability Estimator in Fully Connected Wireless Acoustic Sensor Networks

Model-based distributed node clustering and multi-speaker speech presence probability estimation in wireless acoustic sensor networks.

Dual-Channel Speech Enhancement Based on Extended Kalman Filter Relative Transfer Function Estimation

Binaural Speaker Localization Integrated Into an Adaptive Beamformer for Hearing Aids

New Results in Modulation-Domain Single-Channel Speech Enhancement

Speech enhancement via two-stage dual tree complex wavelet packet transform with a speech presence probability estimator.

Wavelet Packet Transform based Speech Enhancement via Two-Dimensional SPP Estimator with Generalized Gamma Priors

Wavelet based speech presence probability estimator for speech enhancement

Gaussian Model-Based Multichannel Speech Presence Probability

Improved A Posteriori Speech Presence Probability Estimation Based on a Likelihood Ratio With Fixed Priors

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Speech Presence Probability Estimation Research Articles

Related Topics

Articles published on Speech Presence Probability Estimation

Incorporation of a modified temporal cepstrum smoothing in both signal-to-noise ratio and speech presence probability estimation for speech enhancement

An Analysis of Traditional Noise Power Spectral Density Estimators Based on the Gaussian Stochastic Volatility Model

Dual Microphone Speech Enhancement Based on Statistical Modeling of Interchannel Phase Difference

Speech enhancement using modified wiener filter based MMSE and speech presence probability estimation

Distributed Speech Presence Probability Estimator in Fully Connected Wireless Acoustic Sensor Networks

Model-based distributed node clustering and multi-speaker speech presence probability estimation in wireless acoustic sensor networks.

Dual-Channel Speech Enhancement Based on Extended Kalman Filter Relative Transfer Function Estimation

Binaural Speaker Localization Integrated Into an Adaptive Beamformer for Hearing Aids

New Results in Modulation-Domain Single-Channel Speech Enhancement

Speech enhancement via two-stage dual tree complex wavelet packet transform with a speech presence probability estimator.

Wavelet Packet Transform based Speech Enhancement via Two-Dimensional SPP Estimator with Generalized Gamma Priors

Wavelet based speech presence probability estimator for speech enhancement

Gaussian Model-Based Multichannel Speech Presence Probability

Improved A Posteriori Speech Presence Probability Estimation Based on a Likelihood Ratio With Fixed Priors