AERO: A 1.28 MOP/s/LUT Reconfigurable Inference Processor for Recurrent Neural Networks in a Resource-Limited FPGA

Jinwon Kim,Jiho Kim,Tae-Hwan Kim

doi:10.3390/electronics10111249

Abstract

This study presents a resource-efficient reconfigurable inference processor for recurrent neural networks (RNN), named AERO. AERO is programmable to perform inference on RNN models of various types. This was designed based on the instruction-set architecture specializing in processing primitive vector operations that compose the dataflows of RNN models. A versatile vector-processing unit (VPU) was incorporated to perform every vector operation and achieve a high resource efficiency. Aiming at a low resource usage, the multiplication in VPU is carried out on the basis of an approximation scheme. In addition, the activation functions are realized with the reduced tables. We developed a prototype inference system based on AERO using a resource-limited field-programmable gate array, under which the functionality of AERO was verified extensively for inference tasks based on several RNN models of different types. The resource efficiency of AERO was found to be as high as 1.28 MOP/s/LUT, which is 1.3-times higher than the previous state-of-the-art result.

Highlights

This study presents a resource-efficient reconfigurable inference processor for recurrent neural networks (RNN), named AERO
The functionality of AERO was verified successfully by programming it to perform inference tasks based on the various RNN models listed in Tables 5 and 6 for the sequential
The ALUT counts can be obtained from the ALM [24] counts considering the number of the ALUTs in each ALM in the target devices. e This result corresponds to the BRAM instances for implementing activation memory (AM), weight memory (WM), bias memory (BM), and instruction memory (IM), which are associated directly with AERO.f The number inside the parentheses corresponds to the result of AERO itself, while the number outside the parentheses corresponds to that of the entire system

Summary

Introduction

Recurrent neural networks (RNN) are a class of artificial neural networks whose dataflows have feedback connections Such recurrent dataflows enable inference to be performed in a stateful manner that is based on the current and past inputs, thereby, recognizing the temporal characteristics [1]. An efficient architecture to perform the GRU inference was presented based on the modified model exploiting the temporal sparsity [17]. AERO is an instruction-set processor that can be programmed to perform RNN inference based on models of various types, where its instruction-set architecture (ISA) is formulated to efficiently perform the common primitive vector operations composing the dataflows of the models.

Dataflow of RNN Inference

RNN-Specific Instruction-Set Architecture

Processing Pipeline

Vector Processing Unit Based on the Approximate Multipliers

Activation Coefficient Unit Based on the Reduced Tables

Prototype Inference System

Results and Evaluation

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronics	Publication Date: May 24, 2021
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

AERO: A 1.28 MOP/s/LUT Reconfigurable Inference Processor for Recurrent Neural Networks in a Resource-Limited FPGA

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

On RNN Models for Solving Dynamic System of Linear Equations
Huiyan Lu ... Haiqi Liu
-
Huiyan Lu, et. al.Huiyan Lu ... Haiqi Liu
01 Dec 2019
01 Dec 2019

Sequence-based statistical downscaling and its application to hydrologic simulations based on machine learning and big data
Qingrui Wang ... Xinghui Xia
Journal of hydrology | VOL. 586
Qingrui Wang, et. al.Qingrui Wang ... Xinghui Xia
21 Mar 2020
Journal of hydrology | VOL. 586

A novel activation function based recurrent neural networks and their applications on sentiment classification and dynamic problems solving.
Qingyi Zhu ... Mingtao Tan
Frontiers in neurorobotics | VOL. 16
Qingyi Zhu, et. al.Qingyi Zhu ... Mingtao Tan
23 Sep 2022
Frontiers in neurorobotics | VOL. 16

Is the LSTM Model Better than RNN for Flood Forecasting Tasks? A Case Study of HuaYuankou Station and LouDe Station in the Lower Yellow River Basin
Yiyang Wang ... Dongmei Xu
Water | VOL. 15
Yiyang Wang, et. al.Yiyang Wang ... Dongmei Xu
10 Nov 2023
Water | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AERO: A 1.28 MOP/s/LUT Reconfigurable Inference Processor for Recurrent Neural Networks in a Resource-Limited FPGA

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics