Constrained Iterative Speech Enhancement Using Phonetic Classes

Amit Das,John H. L. Hansen

doi:10.1109/tasl.2012.2191282

Abstract

The degree of influence of noise over phonemes is not uniform since it is dependent on their distinct acoustic properties. In this study, the problem of selectively enhancing speech based on broad phoneme classes is addressed using Auto-(LSP), a constrained iterative speech enhancement algorithm. Multiple enhanced utterances are generated for every noisy utterance by varying the Auto-LSP parameters. The noisy utterance is then partitioned into segments based on broad level phoneme classes, and constraints are applied on each segment using a hard decision solution. To alleviate the effect of hard decision errors, a Gaussian mixture model (GMM)-based maximum-likelihood (ML) soft decision solution is also presented. The resulting utterances are evaluated over the TIMIT speech corpus using the Itakura-Saito, segmental signal-to-noise ratio (SNR) and perceptual evaluation of speech quality (PESQ) metrics over four noise types at three SNR levels. Comparative assessment over baseline enhancement algorithms like Auto-LSP, log-minimum mean squared error (log-MMSE), and log-MMSE with speech presence uncertainty (log-MMSE-SPU) demonstrate that the proposed solution exhibits greater consistency in improving speech quality over most phoneme classes and noise types considered in this study.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Constrained Iterative Speech Enhancement Using Phonetic Classes

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE Transactions on Audio, Speech, and Language Processing	Publication Date: Aug 1, 2012
Citations: 20

Similar Papers

A Multi-Band Speech Enhancement Algorithm Exploiting Iterative Processing for Enhancement of Single Channel Speech
Navneet Upadhyay ... Abhijit Karmakar
Journal of Signal and Information Processing | VOL. 04
Navneet Upadhyay, et. al.Navneet Upadhyay ... Abhijit Karmakar
01 Jan 2013
Journal of Signal and Information Processing | VOL. 04

Multichannel MMSE Wiener Filter Using Complex Real and Imaginary Spectral Coefficients for Distributed Microphone Speech Enhancement
...
-
, et. al. ...
20 Dec 2016
20 Dec 2016

MMSE Log-Spectral Amplitude Estimation for Single Channel Speech Enhancement Under Speech Presence Uncertainty by Weibull Speech Priors
Mojtaba Bahrami ... Sanaz Seyedin
-
Mojtaba Bahrami, et. al.Mojtaba Bahrami ... Sanaz Seyedin
01 May 2018
01 May 2018

Improved speech absence probability estimation based on environmental noise classification
Young-Ho Son ... Sang-Min Lee
Journal of Central South University | VOL. 19
Young-Ho Son, et. al.Young-Ho Son ... Sang-Min Lee
01 Sep 2012
Journal of Central South University | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Constrained Iterative Speech Enhancement Using Phonetic Classes

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing