E-LSTM: An Efficient Hardware Architecture for Long Short-Term Memory

Meiqi Wang,Jun Lin,Zhisheng Wang,Zhongfeng Wang,Jinming Lu

doi:10.1109/jetcas.2019.2911739

Abstract

Long Short-Term Memory (LSTM) and its variants have been widely adopted in many sequential learning tasks, such as speech recognition and machine translation. Significant accuracy improvements can be achieved using complex LSTM model with a large memory requirement and high computational complexity, which is time-consuming and energy demanding. The low-latency and energy-efficiency requirements of the real-world applications make model compression and hardware acceleration for LSTM an urgent need. In this paper, several hardware-efficient network compression schemes are introduced first, including structured top- $k$ pruning, clipped gating, and multiplication-free quantization, to reduce the model size and the number of matrix operations by 32 $\times $ and 21.6 $\times $ , respectively, with negligible accuracy loss. Furthermore, efficient hardware architectures for accelerating the compressed LSTM are proposed, which support the inference of multi-layer and multiple time steps. The computation process is judiciously reorganized and the memory access pattern is well optimized, which alleviate the limited memory bandwidth bottleneck and enable higher throughput. Moreover, the parallel processing strategy is carefully designed to make full use of the sparsity introduced by pruning and clipped gating with high hardware utilization efficiency. Implemented on Intel Arria10 S $\times $ 660 FPGA running at 200MHz, the proposed design is able to achieve 1.4–2.2 $\times $ energy efficiency and requires significantly less hardware resources compared with the state-of-the-art LSTM implementations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

E-LSTM: An Efficient Hardware Architecture for Long Short-Term Memory

Abstract

Talk to us

Similar Papers

More From: IEEE Journal on Emerging and Selected Topics in Circuits and Systems

Lead the way for us

Journal: IEEE Journal on Emerging and Selected Topics in Circuits and Systems	Publication Date: Jun 1, 2019
Citations: 74

Similar Papers

Hybrid Analog-Spiking Long Short-Term Memory for Energy Efficient Computing on Edge Devices
Wachirawit Ponghiran ... Kaushik Roy
-
Wachirawit Ponghiran, et. al.Wachirawit Ponghiran ... Kaushik Roy
01 Feb 2021
01 Feb 2021

A Spiking LSTM Accelerator for Automatic Speech Recognition Application Based on FPGA
Tingting Yin ... Chenghao Ouyang
Electronics | VOL. 13
Tingting Yin, et. al.Tingting Yin ... Chenghao Ouyang
21 Feb 2024
Electronics | VOL. 13

Accelerating Inference In Long Short-Term Memory Neural Networks
Thomas Mealey ... Tarek M Taha
-
Thomas Mealey, et. al.Thomas Mealey ... Tarek M Taha
01 Jul 2018
01 Jul 2018

Speech Recognition Using Artificial Neural Network
Arpita Gupta ... Akshay Joshi
-
Arpita Gupta, et. al.Arpita Gupta ... Akshay Joshi
01 Apr 2018
01 Apr 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

E-LSTM: An Efficient Hardware Architecture for Long Short-Term Memory

Abstract

Talk to us

Similar Papers

More From: IEEE Journal on Emerging and Selected Topics in Circuits and Systems