Sichuan dialect speech recognition with deep LSTM network

Wangyang Ying,Lei Zhang,Hongli Deng

doi:10.1007/s11704-018-8030-z

Abstract

In speech recognition research, because of the variety of languages, corresponding speech recognition systems need to be constructed for different languages. Especially in a dialect speech recognition system, there are many special words and oral language features. In addition, dialect speech data is very scarce. Therefore, constructing a dialect speech recognition system is difficult. This paper constructs a speech recognition system for Sichuan dialect by combining a hidden Markov model (HMM) and a deep long short-term memory (LSTM) network. Using the HMM-LSTM architecture, we created a Sichuan dialect dataset and implemented a speech recognition system for this dataset. Compared with the deep neural network (DNN), the LSTM network can overcome the problem that the DNN only captures the context of a fixed number of information items. Moreover, to identify polyphone and special pronunciation vocabularies in Sichuan dialect accurately, we collect all the characters in the dataset and their common phoneme sequences to form a lexicon. Finally, this system yields a 11.34% character error rate on the Sichuan dialect evaluation dataset. As far as we know, it is the best performance for this corpus at present.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Sichuan dialect speech recognition with deep LSTM network

Abstract

Talk to us

Similar Papers

More From: Frontiers of Computer Science

Lead the way for us

Journal: Frontiers of Computer Science	Publication Date: Aug 30, 2019
Citations: 29

Similar Papers

Nonlinear unsteady bridge aerodynamics: Reduced-order modeling based on deep LSTM networks
Tao Li ... Zhao Liu
Journal of Wind Engineering and Industrial Aerodynamics | VOL. 198
Tao Li, et. al.Tao Li ... Zhao Liu
08 Feb 2020
Journal of Wind Engineering and Industrial Aerodynamics | VOL. 198

Distilling the Knowledge From Handcrafted Features for Human Activity Recognition
Zhenghua Chen ... Zhiguang Cao
IEEE Transactions on Industrial Informatics | VOL. 14
Zhenghua Chen, et. al.Zhenghua Chen ... Zhiguang Cao
01 Oct 2018
IEEE Transactions on Industrial Informatics | VOL. 14

Anomaly Detection with Deep Long Short Term Memory Networks
Merve Begum Terzi
-
Merve Begum TerziMerve Begum Terzi
15 Sep 2021
15 Sep 2021

A Deep Bidirectional Highway Long Short-Term Memory Network Approach to Chinese Semantic Role Labeling
Qi Xia ... Chung-Hsing Yeh
-
Qi Xia, et. al.Qi Xia ... Chung-Hsing Yeh
01 Jul 2019
01 Jul 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sichuan dialect speech recognition with deep LSTM network

Abstract

Talk to us

Similar Papers

More From: Frontiers of Computer Science