Abstract

In human–computer interaction (HCI) systems for Mandarin learning, tone recognition is of great importance. A brand-new tone recognition method based on random forest (RF) and feature fusion is proposed in this study. Firstly, three fusion feature sets (FFSs) were created by using different fusion methods on sound source features linked to Mandarin syllable tone. Following the construction of the CART decision trees using the three FFSs, modeling and optimization of the corresponding RF tone classifiers were performed. The method was tested and evaluated on the Syllable Corpus of Standard Chinese (SCSC), which is a speaker-independent Mandarin monosyllable corpus. Additionally, the effects were also assessed on small sample sets. The results show that the tone recognition algorithm can achieve high tone recognition accuracy and has good generalization capability and classification ability with unbalanced data. This indicates that the proposed approach is highly efficient and robust and is appropriate for mobile HCI learning systems.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call