Framewise phoneme classification with bidirectional LSTM and other neural network architectures

Alex Graves,Jürgen Schmidhuber

doi:10.1016/j.neunet.2005.06.042

Abstract

In this paper, we present bidirectional Long Short Term Memory (LSTM) networks, and a modified, full gradient version of the LSTM learning algorithm. We evaluate Bidirectional LSTM (BLSTM) and several other network architectures on the benchmark task of framewise phoneme classification, using the TIMIT database. Our main findings are that bidirectional networks outperform unidirectional ones, and Long Short Term Memory (LSTM) is much faster and also more accurate than both standard Recurrent Neural Nets (RNNs) and time-windowed Multilayer Perceptrons (MLPs). Our results support the view that contextual information is crucial to speech processing, and suggest that BLSTM is an effective architecture with which to exploit it. 1 1 An abbreviated version of some portions of this article appeared in ( Graves and Schmidhuber, 2005), as part of the IJCNN 2005 conference proceedings, published under the IEEE copyright.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Framewise phoneme classification with bidirectional LSTM and other neural network architectures

Abstract

Talk to us

Similar Papers

More From: Neural Networks

Lead the way for us

Journal: Neural Networks	Publication Date: Jul 1, 2005
Citations: 3637

Similar Papers

Enhancing Electrical Load Prediction Using a Bidirectional LSTM Neural Network
Christos Pavlatos ... Valeri Mladenov
Electronics | VOL. 12
Christos Pavlatos, et. al.Christos Pavlatos ... Valeri Mladenov
15 Nov 2023
Electronics | VOL. 12

An ensemble method to forecast 24-h ahead solar irradiance using wavelet decomposition and BiLSTM deep learning network.
Pardeep Singla ... Manoj Duhan
Earth Science Informatics | VOL. 15
Pardeep Singla, et. al.Pardeep Singla ... Manoj Duhan
17 Nov 2021
Earth Science Informatics | VOL. 15

Enhancing source code retrieval with joint Bi-LSTM-GNN architecture: A comparative study with ChatGPT-LLM
Nazia Bibi ... Tauseef Rana
Journal of King Saud University - Computer and Information Sciences | VOL. 36
Nazia Bibi, et. al.Nazia Bibi ... Tauseef Rana
14 Dec 2023
Journal of King Saud University - Computer and Information Sciences | VOL. 36

End-to-End Model Based on Bidirectional LSTM and CTC for Online Handwritten Mongolian Word Recognition
Da Teng ... Fengshan Bai
-
Da Teng, et. al.Da Teng ... Fengshan Bai
14 Oct 2022
14 Oct 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Framewise phoneme classification with bidirectional LSTM and other neural network architectures

Abstract

Talk to us

Similar Papers

More From: Neural Networks