Framewise phoneme classification with bidirectional LSTM networks

A Graves,J Schmidhuber

doi:10.1109/ijcnn.2005.1556215

Framewise phoneme classification with bidirectional LSTM networks

A Graves, J Schmidhuber

Open Access

https://doi.org/10.1109/ijcnn.2005.1556215

Copy DOI

Publication Date: Dec 27, 2005
Citations: 300	License type: other-oa

Affiliation: Dalle Molle Institute for Artificial Intelligence Research

#Long Short Term Memory #Unidirectional Long Short Term Memory + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In this paper, we apply bidirectional training to a long short term memory (LSTM) network for the first time. We also present a modified, full gradient version of the LSTM learning algorithm. We discuss the significance of framewise phoneme classification to continuous speech recognition, and the validity of using bidirectional networks for online causal tasks. On the TIMIT speech database, we measure the framewise phoneme classification scores of bidirectional and unidirectional variants of both LSTM and conventional recurrent neural networks (RNNs). We find that bidirectional LSTM outperforms both RNNs and unidirectional LSTM.

Full Text