A Multi-Resolution Hidden Markov Model Using Class-Specific Features

Paul M Baggenstoss

doi:10.1109/tsp.2010.2052458

Abstract

We apply the PDF projection theorem to generalize the hidden Markov model (HMM) to accommodate multiple simultaneous segmentations of the raw data and multiple feature extraction transformations. Different segment sizes and feature transformations are assigned to each state. The algorithm averages over all allowable segmentations by mapping the segmentations to a “proxy” HMM and using the forward procedure. A by-product of the algorithm is the set of a posteriori state probability estimates that serve as a description of the input data. These probabilities have simultaneously the temporal resolution of the smallest processing windows and the processing gain and frequency resolution of the largest processing windows. The method is demonstrated on the problem of precisely modeling the consonant “T” in order to detect the presence of a distinct “burst” component. We compare the algorithm against standard speech analysis methods using data from the TIMIT corpus.

Highlights

The Hidden Markov Model (HMM) [1] combined with spectral analysis using cepstral coefficients [2] on fixedlength analysis windows remains at the forefront of automatic speech recognition (ASR) technology
The need for a fixed-size window arises from the fundamental probabilistic approach that underlies the method and depends on the comparison of likelihood functions formed on a common feature space
One could not directly compare two likelihood functions if they are defined on different feature spaces

Summary

INTRODUCTION

The Hidden Markov Model (HMM) [1] combined with spectral analysis using cepstral coefficients [2] on fixedlength analysis windows remains at the forefront of automatic speech recognition (ASR) technology. The value of L(X) calculated by the forward procedure operating on Pt,fq changes, it remains a valid joint PDF of X We know this because all we have done is replace the the conditional PDFs P(X|Q) assuming all the segments are independent with another PDF that assumes statistical dependence within the wait state sequences associated with a given state. At this point we have a raw-data based MRHMM model that we can compute efficiently using the forward procedure operating on Pt,fq. Let p(zs|s) be a PDF estimate of the feature set zs based on training data from state s. J(x; Ts, H0,s) has a simple form based on the Fisher’s information matrix [6]

PRACTICAL IMPLEMENTATION DETAILS

Slave Partitions

Efficient Implementation

Simulated Data

50 Time step

Speech Data

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Signal Processing	Publication Date: Oct 1, 2010
Citations: 32	License type: cc-by

R Discovery Prime

R Discovery Prime

A Multi-Resolution Hidden Markov Model Using Class-Specific Features

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Transactions on Signal Processing

Lead the way for us

Similar Papers

Classification of Bisyllabic Lexical Stress Patterns Using Deep Neural Networks
Mostafa Shahin ... Beena Ahmed
-
Mostafa Shahin, et. al.Mostafa Shahin ... Beena Ahmed
01 Jan 2015
01 Jan 2015

Optimisation of HMM topology and its model parameters by genetic algorithms
S Kwong ... K.S Tang
Pattern Recognition | VOL. 34
S Kwong, et. al.S Kwong ... K.S Tang
01 Feb 2001
Pattern Recognition | VOL. 34

Capacity and complexity of HMM duration modeling techniques
M.T Johnson
IEEE Signal Processing Letters | VOL. 12
M.T JohnsonM.T Johnson
01 May 2005
IEEE Signal Processing Letters | VOL. 12

Comparative Analysis of 1-D HMM and 2-D HMM for Hand Motion Recognition Applications
K Martin Sagayam ... D Jude Hemanth
-
K Martin Sagayam, et. al.K Martin Sagayam ... D Jude Hemanth
13 Jul 2017
13 Jul 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Multi-Resolution Hidden Markov Model Using Class-Specific Features

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Transactions on Signal Processing