Research on Mongolian acoustic model based on BLSTM-CTC for Inner Mongolia Electric Power

Tuya Li,Yiming Zhao,Shasha Su,Yaoting Han,Shan Li,Xiaoyu Chen

doi:10.1109/cniot55862.2022.00012

Abstract

In terms of intelligent voice customer service of Inner Mongolia Electric Power, there are a large number of Mongolian speakers. The Mongolian speech recognition in it mainly applies Q&A mode which uses sentences for realizing human-machine dialogue. However, in the process of training the Mongolian acoustic model based on deep neural network-hidden markov model (DNN-HMM), the fragment information of Mongolian speech is mainly applied because of different lengths of speech sentences, it ignores integrity of speech sentences. In this regard, this paper proposes a Mongolian acoustic model based on Bi-directional Long Short-Term Memory-Connectionist Temporal Classification (BLSTM-CTC), which unifies length of input sentences and models complete sentences by inserting BLANK features and labels. The results of comparison experiment of speech recognition between BLSTM-CTC and DNN-HMM shows lower word error rate and sentence error rate of speech recognition based on BLSTM-CTC, especially in later, with reduces by 3.57% and 4.09% respectively. That indicates modeling ability of BLSTM-CTC, especially the modeling ability for sentences, is obviously higher than the DNN-HMM.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Research on Mongolian acoustic model based on BLSTM-CTC for Inner Mongolia Electric Power

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Unsupervised cross-adaptation using language model and deep learning based acoustic model adaptations
Akira Takagi ... Tetsuo Kosaka
-
Akira Takagi, et. al.Akira Takagi ... Tetsuo Kosaka
01 Dec 2014
01 Dec 2014

Speech bottleneck feature extraction method based on overlapping group lasso sparse deep neural network
Yuan Luo ... Congcong Yue
Speech Communication | VOL. 99
Yuan Luo, et. al.Yuan Luo ... Congcong Yue
10 Mar 2018
Speech Communication | VOL. 99

Novel hybrid DNN approaches for speaker verification in emotional and stressful talking environments
Ismail Shahin ... Adi Alhudhaif
Neural Computing and Applications | VOL. 33
Ismail Shahin, et. al.Ismail Shahin ... Adi Alhudhaif
22 Jun 2021
Neural Computing and Applications | VOL. 33

The accuracy of radiology speech recognition reports in a multilingual South African teaching hospital.
Jacqueline Du Toit ... Retha Hattingh
BMC Medical Imaging | VOL. 15
Jacqueline Du Toit, et. al.Jacqueline Du Toit ... Retha Hattingh
04 Mar 2015
BMC Medical Imaging | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Research on Mongolian acoustic model based on BLSTM-CTC for Inner Mongolia Electric Power

Abstract

Talk to us

Similar Papers