EdgeDRNN: Recurrent Neural Network Accelerator for Edge Inference

Chang Gao,Xi Chen,Shih-Chii Liu,Antonio Rios-Navarro,Tobi Delbruck

doi:10.1109/jetcas.2020.3040300

Abstract

Low-latency, low-power portable recurrent neural network (RNN) accelerators offer powerful inference capabilities for real-time applications such as IoT, robotics, and human-machine interaction. We propose a lightweight Gated Recurrent Unit (GRU)-based RNN accelerator called EdgeDRNN that is optimized for low-latency edge RNN inference with batch size of 1. EdgeDRNN adopts the spiking neural network inspired delta network algorithm to exploit temporal sparsity in RNNs. Weights are stored in inexpensive DRAM which enables EdgeDRNN to compute large multi-layer RNNs on the most inexpensive FPGA. The sparse updates reduce DRAM weight memory access by a factor of up to 10x and the delta can be varied dynamically to trade-off between latency and accuracy. EdgeDRNN updates a 5 million parameter 2-layer GRU-RNN in about 0.5ms. It achieves latency comparable with a 92W Nvidia 1080 GPU. It outperforms NVIDIA Jetson Nano, Jetson TX2 and Intel Neural Compute Stick 2 in latency by 5X. For a batch size of 1, EdgeDRNN achieves a mean effective throughput of 20.2GOp/s and a wall plug power efficiency that is over 4X higher than the commercial edge AI platforms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Journal on Emerging and Selected Topics in Circuits and Systems	Publication Date: Dec 1, 2020
Citations: 56	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

EdgeDRNN: Recurrent Neural Network Accelerator for Edge Inference

Abstract

Talk to us

Similar Papers

More From: IEEE Journal on Emerging and Selected Topics in Circuits and Systems

Lead the way for us

Similar Papers

EdgeDRNN: Enabling Low-latency Recurrent Neural Network Edge Inference
Chang Gao ... Antonio Rios-Navarro
-
Chang Gao, et. al.Chang Gao ... Antonio Rios-Navarro
01 Aug 2020
01 Aug 2020

Editor's evaluation: Neural population dynamics of computing with synaptic modulations
Gianluigi Mongillo
-
Gianluigi MongilloGianluigi Mongillo
08 Jan 2023
08 Jan 2023

Author response: Neural population dynamics of computing with synaptic modulations
Kyle Aitken ... Stefan Mihalas
-
Kyle Aitken, et. al.Kyle Aitken ... Stefan Mihalas
10 Feb 2023
10 Feb 2023

Decision letter: Neural population dynamics of computing with synaptic modulations
Omri Barak ... Joshua I Gold
-
Omri Barak, et. al.Omri Barak ... Joshua I Gold
08 Jan 2023
08 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

EdgeDRNN: Recurrent Neural Network Accelerator for Edge Inference

Abstract

Talk to us

Similar Papers

More From: IEEE Journal on Emerging and Selected Topics in Circuits and Systems