Abstract

Word juncture coarticulation is one of the major sources of acoustic variability for initial and final word segments when spoken in fluent speech. One way to improve characterization of word pronuciations in continuous speech is to include inter-word contexts in lexical representations, similar to the way intra-word contexts are utilized. In this paper we investigate the issues related to the modeling of this set of inter-word, context-dependent units using continuous density hidden Markov models. Under such a modeling framework, it is usually required to have enough training tokens available for each unit in order reliably to estimate the parameters of the unit. Therefore, each context-dependent unit is included in the set of units to be modeled only when its frequency of occurrence in the training data exceeds a prescribed threshold. Testing such a unit selection and modeling strategy on the DARPA resource management task, it was found that the incorporation of inter-word units gave a 15–25% word error reduction compared to the base-line continuous speech recognition system using only intra-word units.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.