Approximate Computing for Long Short Term Memory (LSTM) Neural Networks

Sanchari Sen,Anand Raghunathan

doi:10.1109/tcad.2018.2858362

Abstract

Long Short Term Memory (LSTM) networks are a class of recurrent neural networks that are widely used for machine learning tasks involving sequences, including machine translation, text generation, and speech recognition. Large-scale LSTMs, which are deployed in many real-world applications, are highly compute intensive. To address this challenge, we propose AxLSTM, an application of approximate computing to improve the execution efficiency of LSTMs. An LSTM is composed of cells, each of which contains a cell state along with multiple gating units that control the addition and removal of information from the state. The LSTM execution proceeds in timesteps, with a new symbol of the input sequence processed at each timestep. AxLSTM consists of two techniques—Dynamic Timestep Skipping (DTS) and Dynamic State Reduction (DSR). DTS identifies, at runtime, input symbols that are likely to have little or no impact on the cell state and skips evaluating the corresponding timesteps. In contrast, DSR reduces the size of the cell state in accordance with the complexity of the input sequence, leading to a reduced number of computations per timestep. We describe how AxLSTM can be applied to the most common application of LSTMs, viz. , sequence-to-sequence learning. We implement AxLSTM within the TensorFlow deep learning framework and evaluate it on 3 state-of-the-art sequence-to-sequence models. On a 2.7 GHz Intel Xeon server with 128 GB memory and 32 processor cores, AxLSTM achieves $ {1.08\times -1.31 \times }$ speedups with minimal loss in quality, and $ {1.12 \times -1.37 \times }$ speedups when moderate reductions in quality are acceptable.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems	Publication Date: Nov 1, 2018
Citations: 57	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

Approximate Computing for Long Short Term Memory (LSTM) Neural Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

Lead the way for us

Similar Papers

Using long short term memory and convolutional neural networks for driver drowsiness detection
Azhar Quddus ... Felix J.E Comeau
Accident Analysis & Prevention | VOL. 156
Azhar Quddus, et. al.Azhar Quddus ... Felix J.E Comeau
10 Apr 2021
Accident Analysis & Prevention | VOL. 156

Share Market Prediction Using Long Short Term Memory and Artificial Neural Network
J.Aruna Jasmine ... T.P Rani
-
J.Aruna Jasmine, et. al.J.Aruna Jasmine ... T.P Rani
16 Dec 2021
16 Dec 2021

Forex market forecasting with two-layer stacked Long Short-Term Memory neural network (LSTM) and correlation analysis
Michael Ayitey Junior ... Peter Appiahene
Journal of Electrical Systems and Information Technology | VOL. 9
Michael Ayitey Junior, et. al.Michael Ayitey Junior ... Peter Appiahene
30 Jun 2022
Journal of Electrical Systems and Information Technology | VOL. 9

Modeling plasticity during epileptogenesis by long short term memory neural networks.
Marzieh Shahpari ... Javad Mirnajafi-Zadeh
Cognitive Neurodynamics | VOL. 16
Marzieh Shahpari, et. al.Marzieh Shahpari ... Javad Mirnajafi-Zadeh
15 Sep 2021
Cognitive Neurodynamics | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Approximate Computing for Long Short Term Memory (LSTM) Neural Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems