Boosting LSTM Performance Through Dynamic Precision Selection

Franyell Silfa,Antonio Gonzalez,Jose Maria Arnau

doi:10.1109/hipc50609.2020.00046

Franyell Silfa, Antonio Gonzalez + Show 1 more

Open Access

https://doi.org/10.1109/hipc50609.2020.00046

Copy DOI

Publication Date: Dec 1, 2020
Citations: 3	License type: public-domain

Affiliation: Universitat Politècnica de Catalunya

Abstract

The use of low numerical precision is a fundamental optimization included in modern accelerators for Deep Neural Networks (DNNs). The number of bits of the numerical representation is set to the minimum precision that is able to retain accuracy based on an offline profiling, and it is kept constant for DNN inference. In this work, we explore the use of dynamic precision selection during DNN inference. We focus on Long Short Term Memory (LSTM) networks, which represent the state-of-the-art networks for applications such as machine translation and speech recognition. Unlike conventional DNNs, LSTM networks remember information from previous evaluations by storing data in the LSTM cell state. Our key observation is that the cell state determines the amount of precision required: time-steps where the cell state changes significantly require higher precision, whereas time-steps where the cell state is stable can be computed with lower precision without any loss in accuracy. We propose a novel hardware scheme that tracks the evolution of the elements in the LSTM cell state and dynamically selects the appropriate precision on each time-step. For a set of popular LSTM networks, it chooses the lowest precision for 57% of the time, outperforming systems that fix the precision statically. We evaluate our proposal on top of a modern highly-optimized LSTM accelerator, and show that it provides 1.46x speedup and 19.2% energy savings on average without degrading the model accuracy. Our scheme has an overhead of less than 8%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Boosting LSTM Performance Through Dynamic Precision Selection

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Long short term memory network is capable of capturing complex hysteretic dynamics in piezoelectric actuators
Yanfang Liu ... Rui Zhou
Electronics Letters | VOL. 55
Yanfang Liu, et. al.Yanfang Liu ... Rui Zhou
01 Jan 2019
Electronics Letters | VOL. 55

Automated atrial fibrillation prediction using a hybrid long short-term memory network with enhanced whale optimization algorithm on electrocardiogram datasets
Chocko Valliappa ... Sankar Sennan
International Journal of Noncommunicable Diseases | VOL. 6
Chocko Valliappa, et. al.Chocko Valliappa ... Sankar Sennan
01 Nov 2021
International Journal of Noncommunicable Diseases | VOL. 6

Deep PHM: IoT-Based Deep Learning Approach on Prediction of Prognostics and Health Management of an Aircraft Engine
R Mohammed Harun Babu ... P Sivaprakash
-
R Mohammed Harun Babu, et. al.R Mohammed Harun Babu ... P Sivaprakash
01 Nov 2022
01 Nov 2022

Long short-term memory networks for vehicle sensor fusion
Jonah Gandy ... Theresa J Axenson
-
Jonah Gandy, et. al.Jonah Gandy ... Theresa J Axenson
06 Jun 2022
06 Jun 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Boosting LSTM Performance Through Dynamic Precision Selection

Abstract

Talk to us

Similar Papers