A stochastic segment model for phoneme-based continuous speech recognition

M Ostendorf,S Roukos

doi:10.1109/29.45533

Abstract

The authors introduce a novel approach to modeling variable-duration phonemes, called the stochastic segment model. A phoneme X is observed as a variable-length sequence of frames, where each frame is represented by a parameter vector and the length of the sequence is random. The stochastic segment model consists of (1) a time warping of the variable-length segment X into a fixed-length segment Y called a resampled segment and (2) a joint density function of the parameters of X which in this study is a Gaussian density. The segment model represents spectra/temporal structure over the entire phoneme. The model also allows the incorporation in Y of acoustic-phonetic features derived from X, in addition to the usual spectral features that have been used in hidden Markov modeling and dynamic time warping approaches to speech recognition. The authors describe the stochastic segment model, the recognition algorithm, and an iterative training algorithm for estimating segment models from continuous speech. They present several results using segment models in two speaker-dependent recognition tasks and compare the performance of the stochastic segment model to the performance of the hidden Markov models. >

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A stochastic segment model for phoneme-based continuous speech recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Acoustics, Speech, and Signal Processing

Lead the way for us

Journal: IEEE Transactions on Acoustics, Speech, and Signal Processing	Publication Date: Jan 1, 1989
Citations: 176

Similar Papers

A stochastic segment model for phoneme-based continuous speech recognition
S Roucos ... M Dunham
-
S Roucos, et. al.S Roucos ... M Dunham
06 Apr 1987
06 Apr 1987

Statistical Modeling for Continuous Speech Recognition
R Schwartz ... O Kimball
-
R Schwartz, et. al.R Schwartz ... O Kimball
01 Feb 1988
01 Feb 1988

The stochastic segment model for continuous speech recognition
M Ostendorf ... V Digalakis
-
M Ostendorf, et. al.M Ostendorf ... V Digalakis
04 Nov 1991
04 Nov 1991

Context modeling with the stochastic segment model
M Ostendorf ... I Bechwati
-
M Ostendorf, et. al.M Ostendorf ... I Bechwati
01 Jan 1992
01 Jan 1992

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A stochastic segment model for phoneme-based continuous speech recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Acoustics, Speech, and Signal Processing