DLD: An Optimized Chinese Speech Recognition Model Based on Deep Learning

Hong Lei,Heow Pueh Lee,Yanchun Liang,Yue Xiao,Dalin Li

doi:10.1155/2022/6927400

Abstract

Speech recognition technology has played an indispensable role in realizing human‐computer intelligent interaction. However, most of the current Chinese speech recognition systems are provided online or offline models with low accuracy and poor performance. To improve the performance of offline Chinese speech recognition, we propose a hybrid acoustic model of deep convolutional neural network, long short‐term memory, and deep neural network (DCNN‐LSTM‐DNN, DLD). This model utilizes DCNN to reduce frequency variation and adds a batch normalization (BN) layer after its convolutional layer to ensure the stability of data distribution, and then use LSTM to effectively solve the gradient vanishing problem. Finally, the fully connected structure of DNN is utilized to efficiently map the input features into a separable space, which is helpful for data classification. Therefore, leveraging the strengths of DCNN, LSTM, and DNN by combining them into a unified architecture can effectively improve speech recognition performance. Our model was tested on the open Chinese speech database THCHS‐30 released by the Center for Speech and Language Technology (CSLT) of Tsinghua University, and it was concluded that the DLD model with 3 layers of LSTM and 3 layers of DNN had the best performance, reaching 13.49% of words error rate (WER).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Complexity	Publication Date: Jan 1, 2022
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

DLD: An Optimized Chinese Speech Recognition Model Based on Deep Learning

Abstract

Talk to us

Similar Papers

More From: Complexity

Lead the way for us

Similar Papers

A comparative evaluation of deep convolutional neural network and deep neural network-based land use/land cover classifications of mining regions using fused multi-sensor satellite data
Ajay Kumar ... Amit Kumar Gorai
Advances in Space Research | VOL. 72
Ajay Kumar, et. al.Ajay Kumar ... Amit Kumar Gorai
04 Sep 2023
Advances in Space Research | VOL. 72

Design and Development of Integrated Deep Convolution Neural Network Approach for Handling Heterogeneous Medical Data
Arifa J Shikalgar ... Shefali Sonavane
-
Arifa J Shikalgar, et. al.Arifa J Shikalgar ... Shefali Sonavane
01 Feb 2019
01 Feb 2019

Understanding adversarial attack and defense towards deep compressed neural networks
Qi Liu ... Wujie Wen
-
Qi Liu, et. al.Qi Liu ... Wujie Wen
03 May 2018
03 May 2018

Advancing soil erosion prediction in Wadi Sahel-Soummam watershed Algeria: A comparative analysis of deep neural networks (DNN) and convolutional neural networks (CNN) models integrated with GIS
Elhadj Mokhtari ... Messaoud Djeddou
Bulletin of the Serbian Geographical Society | VOL. 104
Elhadj Mokhtari, et. al.Elhadj Mokhtari ... Messaoud Djeddou
01 Jan 2024
Bulletin of the Serbian Geographical Society | VOL. 104

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DLD: An Optimized Chinese Speech Recognition Model Based on Deep Learning

Abstract

Talk to us

Similar Papers

More From: Complexity