Abstract
The authors describe an approach, the stochastic segment model, for context modeling in continuous speech recognition for models based on multivariate Gaussian distributions. Typically, robust context models in hidden Markov models (HMMs) are obtained by using mixture distributions; here the authors tie covariance parameters across classes of similar context. The specific classes over which parameters are tied can be based on models with less context or determined by clustering, where they have investigated both hand-specified linguistically motivated clusters and automatic k-means clustering. Experimental results on phoneme classification show that clustering improves performance, and word recognition results show that error reduction over context-independent models using this approach is comparable to that achieved with discrete hidden-Markov models using mixture distributions.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">></ETX>
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.