Non-Stationary Hidden Markov Models for Speech Recognition

Don X Sun,Li Deng

doi:10.1007/978-1-4612-4056-3_10

Abstract

The standard hidden Markov models (HMM) assume local or state-conditioned stationarity of the signals being modeled. In this article, we present some recent development in generalizing the standard HMM to incorporate the local dynamic patterns as well as the global non-stationarity for speech signal modeling. The major component of the proposed non-stationary HMMs is the parametric regression models for individual HMM states. The regression functions are intended for characterizing the dynamic movements of the signals within a HMM state. Both the EM algorithm (or Baum-Welch algorithm) and the segmental K-means algorithms are generalized to accommodate the complex state duration information needed for the estimation of regression parameters. To allow for the flexibility of linear time warping in individual HMM states, an efficient algorithm is developed with the use of token-dependent auxiliary parameters. Although the auxiliary parameters are of no interest in themselves for modeling speech sound patterns, they provide an intermediate tool for achieving maximal accuracy in estimating the parameters of the regression models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Non-Stationary Hidden Markov Models for Speech Recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A non-stationary hidden Markov model for satellite propagation channel modeling
Hsin-Piao Lin ... Fan-San Tsai
-
Hsin-Piao Lin, et. al. Hsin-Piao Lin ... Fan-San Tsai
10 Dec 2002
10 Dec 2002

Noisy hidden Markov models for speech recognition
Kartik Audhkhasi ... Osonde Osoba
-
Kartik Audhkhasi, et. al.Kartik Audhkhasi ... Osonde Osoba
01 Aug 2013
01 Aug 2013

Sentence‐HMM state‐based i‐vector/PLDA modelling for improved performance in text dependent single utterance speaker verification
Osman Büyük
IET Signal Processing | VOL. 10
Osman BüyükOsman Büyük
01 Oct 2016
IET Signal Processing | VOL. 10

Decision letter: Differential dopaminergic modulation of spontaneous cortico–subthalamic activity in Parkinson’s disease
Kelly Bijanki ... Michael J Frank
-
Kelly Bijanki, et. al.Kelly Bijanki ... Michael J Frank
18 Feb 2021
18 Feb 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Non-Stationary Hidden Markov Models for Speech Recognition

Abstract

Talk to us

Similar Papers