Abstract

Articulatory features,which represent the articulatory information,can help prosodic features to improve the performance of tone recognition.In this paper,a set of 19 pronunciation categories was given according to the pronunciation characteristics of initials and finals.Besides,19 articulatory tandem features,which are the posteriors of speech signal belonging to the 19 pronunciation categories,were obtained by hierarchical multilayer perceptron classifiers.Then these articulatory tandem features,as well as prosodic features,were used for tone modeling.Tone recognition experiments of three kinds of tone models indicate that about 5% absolute increase of accuracy can be achieved when using both articulatory features and prosodic features.When the proposed tone model is integrated into LVSCR(Large Vocabulary Continuous Speech Recognition) system,the character error rate is reduced significantly.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.