An improved residual LSTM architecture for acoustic modeling

Lu Huang,Yi Yang,Jiasong Sun,Ji Xu

doi:10.1109/ccoms.2017.8075276

An improved residual LSTM architecture for acoustic modeling

Lu Huang, Yi Yang + Show 2 more

Open Access

https://doi.org/10.1109/ccoms.2017.8075276

Copy DOI

Publication Date: Jul 1, 2017

Citations: 23

Affiliation: Tsinghua University

#Residual Long Short-Term Memory #Phone Error Rate + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Long Short-Term Memory (LSTM) is the primary recurrent neural networks architecture for acoustic modeling in automatic speech recognition systems. Residual learning is an efficient method to help neural networks converge easier and faster. In this paper, we propose several types of residual LSTM methods for our acoustic modeling. Our experiments indicate that, compared with classic LSTM, our architecture shows more than 8% relative reduction in Phone Error Rate (PER) on TIMIT tasks. At the same time, our residual fast LSTM approach shows 4% relative reduction in PER on the same task. Besides, we find that all this architecture could have good results on THCHS-30, Librispeech and Switchboard corpora.

Full Text