Abstract

Yi language is one of the most representative languages in the Yi branch of the Tibetan-Burmese language family. To strengthen the protection of Yi language, an endangered minority language, this paper covers the continuous speech recognition of Yi language with different deep learning methods to achieve the optimal recognition performance. In the training stage, we first analyzed the Yi language text to realize the language model modeling, and then trained the audio files based on the four acoustic models: hidden Markov model (HMM), deep neural network (DNN), time-delay neural network (TDNN) and end-to-end. In the recognition stage, after matching the language model according to the acoustic model, we obtained the Yi recognized text by combining the dictionary and acoustic feature vector for joint recognition and decoding. Compared with the word error rates of acoustic models, the time-delay neural network is the best in existed Yi corpus, only 16.65%.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.