Maxout neurons based deep bidirectional LSTM for acoustic modeling

Yuan Luo,Zhou Ye,Boyu Wang,Yi Zhang,Yu Liu

doi:10.1109/robio.2017.8324646

Abstract

Recently long short-term memory (LSTM) recurrent neural networks (RNN) have achieved greater success in acoustic models for the large vocabulary continuous speech recognition system. In this paper, we propose an improved hybrid acoustic model based on deep bidirectional long short-term memory (DBLSTM) RNN. In this new acoustic model, maxout neurons are used in the fully-connected part of DBLSTM to solve the problems of vanishing and exploding gradient. At the same time, the dropout regularization algorithm is used to avoid the over-fitting during the training process of neural network. In addition, in order to adapt the bidirectional dependence of DBLSTM at each time step, a context-sensitive-chunk (CSC) back-propagation through time (BPTT) algorithm is proposed to train DBLSTM neural network. Simulation experiments have been made on Switchboard benchmark task. The results show that the WER of the improved hybrid acoustic model is 14.5%, and the optimal network structures and CSC configurations are given.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Maxout neurons based deep bidirectional LSTM for acoustic modeling

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Training Deep Bidirectional LSTM Acoustic Model for LVCSR by a Context-Sensitive-Chunk BPTT Approach
Kai Chen ... Qiang Huo
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24
Kai Chen, et. al.Kai Chen ... Qiang Huo
01 Jul 2016
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24

Training deep bidirectional LSTM acoustic model for LVCSR by a context-sensitive-chunk BPTT approach
...
-
, et. al. ...
01 Jul 2016
01 Jul 2016

Grapheme-to-phoneme conversion using Long Short-Term Memory recurrent neural networks
Kanishka Rao ... Hasim Sak
-
Kanishka Rao, et. al.Kanishka Rao ... Hasim Sak
01 Apr 2015
01 Apr 2015

ECG Beat Classification Based on Deep Bidirectional Long Short-Term Memory Recurrent Neural Network
Runchuan Li ... Gang Chen
-
Runchuan Li, et. al.Runchuan Li ... Gang Chen
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Maxout neurons based deep bidirectional LSTM for acoustic modeling

Abstract

Talk to us

Similar Papers