Improved tone modeling by exploiting articulatory features for Mandarin speech recognition

Hao Chao,Wenju Liu,Zhanlei Yang

doi:10.3724/sp.j.1087.2013.02939

Abstract

Articulatory features,which represent the articulatory information,can help prosodic features to improve the performance of tone recognition.In this paper,a set of 19 pronunciation categories was given according to the pronunciation characteristics of initials and finals.Besides,19 articulatory tandem features,which are the posteriors of speech signal belonging to the 19 pronunciation categories,were obtained by hierarchical multilayer perceptron classifiers.Then these articulatory tandem features,as well as prosodic features,were used for tone modeling.Tone recognition experiments of three kinds of tone models indicate that about 5% absolute increase of accuracy can be achieved when using both articulatory features and prosodic features.When the proposed tone model is integrated into LVSCR(Large Vocabulary Continuous Speech Recognition) system,the character error rate is reduced significantly.

Full Text