Abstract
Two kinds of duration control for HMM (hidden Markov model) phoneme recognition are proposed: phoneme duration control for an HMM phone model and an event duration control for an HMM state. The phoneme duration control is carried out by combining an HMM output probability with a phoneme duration probability. The phoneme duration probability is calculated using a phoneme duration histogram obtained from training samples. Phoneme duration control is effective in discriminating phonemes with different durations such as /n/ and /N/. Event duration control is realized as a state duration penalty calculated from an HMM state duration probability of training samples. Event duration control is effective in discriminating phonemes with different event structures such as /s/ and /ts/. Recognition experiments are carried out using Japanese phonemes extracted from an isolated word database uttered by one male speaker. The phoneme recognition rate is improved from 84.8%–90.0% using these duration control techniques.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.