HMM-Based Reconstruction of Unreliable Spectrographic Data for Noise Robust Speech Recognition

Bengt J Borgström,Abeer Alwan

doi:10.1109/tasl.2009.2038811

Abstract

This paper presents a framework for efficient HMM-based estimation of unreliable spectrographic speech data. It discusses the role of hidden Markov models (HMMs) during minimum mean-square error (MMSE) spectral reconstruction. We develop novel HMM-based reconstruction algorithms which exploit intra-channel (across-time) correlation and/or inter-channel (across-frequency) correlation. For the sake of computational efficiency, this paper utilizes approximations to HMM-based decoding methods by developing models constructed from lower resolution quantizers. State configurations for lower resolution models are obtained through a tree-structured mapping of quantizer centroids, and model parameters are adapted accordingly. HMM downsampling avoids expensive retraining of models, and eliminates unnecessary memory requirements. Explicit general formulae are presented for the adaptation of steady-state and transitional statistics. Adaptation of observation statistics are derived from stochastic models of noise spectral magnitude estimation accuracies. The proposed estimation methods are applied in combination with oracle masks, which provide an upper performance bound, as well as masks derived from speech presence probability, which represent a more realistic scenario. Both methods are shown to boost noise robust recognition accuracies significantly relative to the Mel-frequency cepstral coefficient (MFCC) baseline system. Furthermore, HMM downsampling greatly reduces the complexity of the HMM-based reconstruction method while negligibly affecting results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

HMM-Based Reconstruction of Unreliable Spectrographic Data for Noise Robust Speech Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE Transactions on Audio, Speech, and Language Processing	Publication Date: Aug 1, 2010
Citations: 49

Similar Papers

A novel fast nonstationary noise tracking approach based on MMSE spectral power estimator
Qiquan Zhang ... Muhammad Idrees
Digital Signal Processing | VOL. 88
Qiquan Zhang, et. al.Qiquan Zhang ... Muhammad Idrees
05 Feb 2019
Digital Signal Processing | VOL. 88

Codebook-based speech enhancement with Bayesian LP parameters estimation
Qing Wang ... Chang-Chun Bao
-
Qing Wang, et. al.Qing Wang ... Chang-Chun Bao
01 Dec 2015
01 Dec 2015

Using the turbo principle for exploiting temporal and spectral correlations in speech presence probability estimation
Dang Hai Tran Vu ... Reinhold Haeb-Umbach
-
Dang Hai Tran Vu, et. al.Dang Hai Tran Vu ... Reinhold Haeb-Umbach
01 May 2013
01 May 2013

HMM based isolated Kannada digit recognition system using MFCC
H Muralikrishna ... Kumara Shama
-
H Muralikrishna, et. al.H Muralikrishna ... Kumara Shama
01 Aug 2013
01 Aug 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

HMM-Based Reconstruction of Unreliable Spectrographic Data for Noise Robust Speech Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing