Abstract

To effectively help second language (L2) Chinese learners to produce tones correctly in computer assisted language learning (CALL), tone recognition of continuous speech is necessary. Because of the complex tone variation in continuous speech, this paper proposed TAM-BLSTM tone recognition model. Firstly, the generation model, target approximation model (TAM) is used to simulate fundamental frequency (f0) from original f0 contour in the unit of prosodic words, and the TAM parameters for each Chinese character are derived. Then BLSTM model with attention mechanism is set up with input feature of the TAM parameters and basic acoustic features, such as statistical f0 parameters, vowel duration, to solve the problem of tone detection of Mandarin continuous speech. Finally, the trained tone detection model is applied to the tone error detection of the L2 learners. The experimental results with Biaobei corpus show that the accuracy of the feature set combined with TAM parameters is 2.3% higher than that of using basic acoustic features alone, and the overall accuracy of ATT-BLSTM network model is higher than that based on ATT-LSTM.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.