Stochastic Computing Architectures for Lightweight LSTM Neural Networks

Roshwin Sengupta,John P Hayes,Ilia Polian

doi:10.1109/ddecs54261.2022.9770167

Abstract

For emerging edge and near-sensor systems to perform hard classification tasks locally, they must avoid costly communication with the cloud. This requires the use of compact classifiers such as recurrent neural networks of the long short term memory (LSTM) type, as well as a low-area hardware technology such as stochastic computing (SC). We study the benefits and costs of applying SC to LSTM design. We consider a design space spanned by fully binary (non-stochastic), fully stochastic, and several hybrid (mixed) LSTM architectures, and design and simulate examples of each. Using standard classification benchmarks, we show that area and power can be reduced up to 47% and 86% respectively with little or no impact on classification accuracy. We demonstrate that fully stochastic LSTMs can deliver acceptable accuracy despite accumulated errors. Our results also suggest that ReLU is preferable to tanh as an activation function in stochastic LSTMs

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Stochastic Computing Architectures for Lightweight LSTM Neural Networks

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Analyzing the performance of long short‐term memory architectures for malware detection models
Cigdem Avci ... Cagatay Catal
Concurrency and Computation: Practice and Experience | VOL. 35
Cigdem Avci, et. al.Cigdem Avci ... Cagatay Catal
27 Jan 2023
Concurrency and Computation: Practice and Experience | VOL. 35

Framewise phoneme classification with bidirectional LSTM and other neural network architectures
Alex Graves ... Jürgen Schmidhuber
Neural Networks | VOL. 18
Alex Graves, et. al.Alex Graves ... Jürgen Schmidhuber
01 Jul 2005
Neural Networks | VOL. 18

Imbalanced Learning of Regular Grammar for DFA Extraction from LSTM Architecture
Anish Sharma ... Rajeev Kumar
-
Anish Sharma, et. al.Anish Sharma ... Rajeev Kumar
01 Jan 2023
01 Jan 2023

Investigation of Load, Solar and Wind Generation as Target Variables in LSTM Time Series Forecasting, Using Exogenous Weather Variables
Thomas Shering ... Dimitra Apostolopoulou
Energies | VOL. 17
Thomas Shering, et. al.Thomas Shering ... Dimitra Apostolopoulou
11 Apr 2024
Energies | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Stochastic Computing Architectures for Lightweight LSTM Neural Networks

Abstract

Talk to us

Similar Papers