Abstract
The authors describe the robustness of six types of phoneme-based HMMs (hidden Markov models) against speaking-style variations. The six types of models are VQ (vector quantization)-based and fuzzy VQ-based discrete HMMs, and single-Gaussian and mixture-Gaussian HMMs with either diagonal or full covariance matrices. The mixture-Gaussian HMM with diagonal covariance matrices, the fuzzy VQ-based discrete HMM, and the single-Gaussian HMM with full covariance matrices show better results than the other three in 18-Japanese-consonant recognition experiments. The authors also propose a model-adaptation technique that combines multiple models using the deleted interpolation. This technique makes models easy to apply to different-speaking-style speech. >
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.