Time Delay Recurrent Neural Network for Speech Recognition

Boji Liu,Weibin Zhang,Xiangming Xu,Dongpeng Chen

doi:10.1088/1742-6596/1229/1/012078

Boji Liu, Weibin Zhang + Show 2 more

Open Access

https://doi.org/10.1088/1742-6596/1229/1/012078

Copy DOI

Abstract

In Automatic Speech Recognition(ASR), Time Delay Neural Network (TDNN) has been proven to be an efficient network structure for its strong ability in context modeling. In addition, as a feed-forward neural architecture, it is faster to train TDNN, compared with recurrent neural networks such as Long Short-Term Memory (LSTM). However, different from recurrent neural networks, the context in TDNN is carefully designed and is limited. Although stacking Long Short-Term Memory (LSTM) together with TDNN in order to extend the context information have been proven to be useful, it is too complex and is hard to train. In this paper, we focus on directly extending the context modeling capability of TDNNs by adding recurrent connections. Several new network architectures were investigated. The results on the Switchboard show that the best model significantly outperforms the base line TDNN system and is comparable with TDNN-LSTM architecture. In addition, the training process is much simpler than that of TDNN-LSTM.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Physics: Conference Series	Publication Date: May 1, 2019
Citations: 9	License type: cc-by

R Discovery Prime

R Discovery Prime

Time Delay Recurrent Neural Network for Speech Recognition

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series

Lead the way for us

Similar Papers

An Exploration of Recurrent Units for Automatic Speech Recognition with RNN based Acoustic Model
Huayang Zhang
-
Huayang ZhangHuayang Zhang
01 Sep 2019
01 Sep 2019

Bidirectional Quaternion Long Short-term Memory Recurrent Neural Networks for Speech Recognition
Titouan Parcollet ... Georges Linares
-
Titouan Parcollet, et. al.Titouan Parcollet ... Georges Linares
01 May 2019
01 May 2019

Time-delay recurrent neural network for temporal correlations and prediction
Sung-Suk Kim
Neurocomputing | VOL. 20
Sung-Suk KimSung-Suk Kim
01 Aug 1998
Neurocomputing | VOL. 20

Performance Evaluation of Speaker Identification in Language and Emotion Mismatch Conditions on Eastern and North Eastern Low Resource Languages of India
Joyanta Basu ... Tapan Kumar Basu
-
Joyanta Basu, et. al.Joyanta Basu ... Tapan Kumar Basu
14 Nov 2021
14 Nov 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Time Delay Recurrent Neural Network for Speech Recognition

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series