Gaussian Scale Mixture Model Research Articles

This paper presents a novel probabilistic approach to speech enhancement. Instead of a deterministic logarithmic relationship, we assume a probabilistic relationship between the frequency coefficients and the log-spectra. The speech model in the log-spectral domain is a Gaussian mixture model (GMM). The frequency coefficients obey a zero-mean Gaussian whose covariance equals to the exponential of the log-spectra. This results in a Gaussian scale mixture model (GSMM) for the speech signal in the frequency domain, since the log-spectra can be regarded as scaling factors. The probabilistic relation between frequency coefficients and log-spectra allows these to be treated as two random variables, both to be estimated from the noisy signals. Expectation-maximization (EM) was used to train the GSMM and Bayesian inference was used to compute the posterior signal distribution. Because exact inference of this full probabilistic model is computationally intractable, we developed two approaches to enhance the efficiency: the Laplace method and a variational approximation. The proposed methods were applied to enhance speech corrupted by Gaussian noise and speech-shaped noise (SSN). For both approximations, signals reconstructed from the estimated frequency coefficients provided higher signal-to-noise ratio (SNR) and those reconstructed from the estimated log-spectra produced lower word recognition error rate because the log-spectra fit the inputs to the recognizer better. Our algorithms effectively reduced the SSN, which algorithms based on spectral analysis were not able to suppress.

Read full abstract

The ill-posed nature of the MEG (or related EEG) source localization problem requires the incorporation of prior assumptions when choosing an appropriate solution out of an infinite set of candidates. Bayesian approaches are useful in this capacity because they allow these assumptions to be explicitly quantified using postulated prior distributions. However, the means by which these priors are chosen, as well as the estimation and inference procedures that are subsequently adopted to affect localization, have led to a daunting array of algorithms with seemingly very different properties and assumptions. From the vantage point of a simple Gaussian scale mixture model with flexible covariance components, this paper analyzes and extends several broad categories of Bayesian inference directly applicable to source localization including empirical Bayesian approaches, standard MAP estimation, and multiple variational Bayesian (VB) approximations. Theoretical properties related to convergence, global and local minima, and localization bias are analyzed and fast algorithms are derived that improve upon existing methods. This perspective leads to explicit connections between many established algorithms and suggests natural extensions for handling unknown dipole orientations, extended source configurations, correlated sources, temporal smoothness, and computational expediency. Specific imaging methods elucidated under this paradigm include the weighted minimum ℓ2-norm, FOCUSS, minimum current estimation, VESTAL, sLORETA, restricted maximum likelihood, covariance component estimation, beamforming, variational Bayes, the Laplace approximation, and automatic relevance determination, as well as many others. Perhaps surprisingly, all of these methods can be formulated as particular cases of covariance component estimation using different concave regularization terms and optimization rules, making general theoretical analyses and algorithmic extensions/improvements particularly relevant.

Read full abstract

Gaussian Scale Mixture Model Research Articles

Related Topics

Articles published on Gaussian Scale Mixture Model

Speech Enhancement Using Gaussian Scale Mixture Models.

Video Denoising Based on a Spatiotemporal Gaussian Scale Mixture Model

Vonn distribution of relative phase for statistical image modeling in complex wavelet domain

A New Image Denoising Method Combining the Nonsubsampled Contourlet Transform and Adaptive Total Variation

Wavelet-Based EM Algorithm for Multispectral-Image Restoration

SAR Image Denoising Based on Lifting Directionlet Domain Gaussian Scale Mixtures Model

Security Analysis on Add-SS Watermarking with GSM

Perceptual organization in the tilt illusion.

A unified Bayesian framework for MEG/EEG source imaging

Denoising of multicomponent images using wavelet least-squares estimators

Wavelet Denoising of Multicomponent Images Using Gaussian Scale Mixture Models and a Noise-Free Image as Priors

Optimality of KLT for High-Rate Transform Coding of Gaussian Vector-Scale Mixtures: Application to Reconstruction, Estimation, and Classification

Topographic Product Models Applied to Natural Scene Statistics

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Gaussian Scale Mixture Model Research Articles

Related Topics

Articles published on Gaussian Scale Mixture Model

Speech Enhancement Using Gaussian Scale Mixture Models.

Video Denoising Based on a Spatiotemporal Gaussian Scale Mixture Model

Vonn distribution of relative phase for statistical image modeling in complex wavelet domain

A New Image Denoising Method Combining the Nonsubsampled Contourlet Transform and Adaptive Total Variation

Wavelet-Based EM Algorithm for Multispectral-Image Restoration

SAR Image Denoising Based on Lifting Directionlet Domain Gaussian Scale Mixtures Model

Security Analysis on Add-SS Watermarking with GSM

Perceptual organization in the tilt illusion.

A unified Bayesian framework for MEG/EEG source imaging

Denoising of multicomponent images using wavelet least-squares estimators

Wavelet Denoising of Multicomponent Images Using Gaussian Scale Mixture Models and a Noise-Free Image as Priors

Optimality of KLT for High-Rate Transform Coding of Gaussian Vector-Scale Mixtures: Application to Reconstruction, Estimation, and Classification

Topographic Product Models Applied to Natural Scene Statistics