Noise Power Spectral Density Estimation Research Articles

Speech enhancement based on statistical models has been studied for several decades. Recently, the speech enhancement adopting a speech power spectral density (PSD) uncertainty model has been proposed. This approach distinguishes the true speech PSD from its estimate and considers both as random variables. It incorporates a prior distribution of speech spectra and speech PSD estimators to derive the PSD uncertainty-aware counterpart to conventional clean speech estimators, which results in performance improvement. However, the speech PSD uncertainty model has not yet been adopted for parameter estimations such as <inline-formula><tex-math notation="LaTeX"><?TeX $\mathit{a posteriori}$?></tex-math></inline-formula> speech presence probability, noise PSD, and speech power spectra estimations in the speech enhancement framework. In this paper, we incorporate the speech PSD uncertainty model to all the components of the statistical model-based speech enhancement framework by deriving PSD uncertainty-aware counterparts to conventional parameter estimators. Specifically, we derive the <inline-formula><tex-math notation="LaTeX"><?TeX $\mathit{a posteriori}$?></tex-math></inline-formula> speech presence probability (SPP) where the likelihood function for each hypothesis is based on the speech PSD uncertainty. With this <inline-formula><tex-math notation="LaTeX"><?TeX $\mathit{a posteriori}$?></tex-math></inline-formula> SPP, a novel SPP-based noise PSD estimator is derived. Also, we derive the minimum mean-square error (MMSE) estimator for the power spectrum of the clean speech in the current frame under speech PSD uncertainty which is exploited to refine the speech PSD estimator. Finally, the refined speech PSD estimator is incorporated into the spectral gain function based on the speech PSD uncertainty model. The proposed approach showed improved noise PSD estimation performance in terms of the averaged logarithmic error distance, and improved speech enhancement performance in terms of the noise reduction, segmental signal-to-noise ratio, perceptual evaluation of speech quality (PESQ) scores and short-time objective intelligibility in our experiments. It also exhibited comparable performance with a real-time deep learning-based speech enhancement system in terms of the PESQ scores and composite measures for the VoiceBank-DEMAND dataset.

Read full abstract

Electrolaryngeal speech, for persons who have lost their larynx, suffers from the drawback of susceptibility to acoustic noise, which includes inherent electrolarynx motor noise as well as environmental noise. Interactions with electrolarynx users motivated the authors to investigate a crucial drawback of electrolaryngeal speech: degradation of electrolaryngeal speech under noisy environments. The effect of contemporary methods of speech enhancement, viz. noise power spectral density estimation based d-dimensional amplitude trimmed estimation (DATE), and non-negative matrix factorization (NMF) based algorithms, were studied and evaluated for electrolaryngeal speech degraded by noise. Electrolaryngeal speech was corrupted using three types of noisy scenarios at low signal to noise ratios (0, −5 and −10 dB SNR). Objective testing based on the perceptual evaluation of speech quality (PESQ) standard, as well as subjective testing by 14 participants based on the ITU-T P.835 standard were performed. Word-centric intelligibility testing was also performed. The results indicated an improvement in the speech quality. The subjective testing results were further analyzed using one-way analysis of variance (ANOVA) and multiple paired comparison using Tukey’s honest significant difference (HSD) criterion. The overall speech quality of NMF algorithms was found to be significantly higher than that of DATE-based algorithms for the test conditions. The results show that speech enhancement algorithms can aid in improving the electrolaryngeal user experience by reducing the effect of acoustic noise.

Read full abstract

Noise Power Spectral Density Estimation Research Articles

Related Topics

Articles published on Noise Power Spectral Density Estimation

Postfilter for Dual Channel Speech Enhancement Using Coherence and Statistical Model-Based Noise Estimation.

An Analysis of Traditional Noise Power Spectral Density Estimators Based on the Gaussian Stochastic Volatility Model

Rotor Noise-Aware Noise Covariance Matrix Estimation for Unmanned Aerial Vehicle Audition

Multi-sensory sound source enhancement for unmanned aerial vehicle recordings

Improved Speech Enhancement Considering Speech PSD Uncertainty

Low-complexity artificial noise suppression methods for deep learning-based speech enhancement algorithms

Evaluation of speech enhancement algorithms applied to electrolaryngeal speech degraded by noise

Gravitational-wave astronomy with an uncertain noise power spectral density

Optimum step-size control for a variable step-size stereo acoustic echo canceller in the frequency domain

DeepMMSE: A Deep Learning Approach to MMSE-Based Noise Power Spectral Density Estimation

Microphone Array Wiener Post Filtering Using Monotone Operator Splitting

A novel fast nonstationary noise tracking approach based on MMSE spectral power estimator

Bone-Conduction Sensor Assisted Noise Estimation for Improved Speech Enhancement.

Robust noise power spectral density estimation for binaural speech enhancement in time-varying diffuse noise field

A novel regularization framework for transient noise reduction

An Analysis of Adaptive Recursive Smoothing with Applications to Noise PSD Estimation

잡음 파워 스펙트럼 밀도 추정을 이용한 서로소 배열과 프로퍼게이터 기법 기반의 향상된 도래각 추정 기법

Recent Developments in Speech Enhancement in the Short-Time Fourier Transform Domain

A Constrained MMSE LP Residual Estimator for Speech Dereverberation in Noisy Environments

An adaptive noise power spectral density estimation of noisy speech using generalized gamma probability density function

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Noise Power Spectral Density Estimation Research Articles

Related Topics

Articles published on Noise Power Spectral Density Estimation

Postfilter for Dual Channel Speech Enhancement Using Coherence and Statistical Model-Based Noise Estimation.

An Analysis of Traditional Noise Power Spectral Density Estimators Based on the Gaussian Stochastic Volatility Model

Rotor Noise-Aware Noise Covariance Matrix Estimation for Unmanned Aerial Vehicle Audition

Multi-sensory sound source enhancement for unmanned aerial vehicle recordings

Improved Speech Enhancement Considering Speech PSD Uncertainty

Low-complexity artificial noise suppression methods for deep learning-based speech enhancement algorithms

Evaluation of speech enhancement algorithms applied to electrolaryngeal speech degraded by noise

Gravitational-wave astronomy with an uncertain noise power spectral density

Optimum step-size control for a variable step-size stereo acoustic echo canceller in the frequency domain

DeepMMSE: A Deep Learning Approach to MMSE-Based Noise Power Spectral Density Estimation

Microphone Array Wiener Post Filtering Using Monotone Operator Splitting

A novel fast nonstationary noise tracking approach based on MMSE spectral power estimator

Bone-Conduction Sensor Assisted Noise Estimation for Improved Speech Enhancement.

Robust noise power spectral density estimation for binaural speech enhancement in time-varying diffuse noise field

A novel regularization framework for transient noise reduction

An Analysis of Adaptive Recursive Smoothing with Applications to Noise PSD Estimation

잡음 파워 스펙트럼 밀도 추정을 이용한 서로소 배열과 프로퍼게이터 기법 기반의 향상된 도래각 추정 기법

Recent Developments in Speech Enhancement in the Short-Time Fourier Transform Domain

A Constrained MMSE LP Residual Estimator for Speech Dereverberation in Noisy Environments

An adaptive noise power spectral density estimation of noisy speech using generalized gamma probability density function