Automatic RTL Generation Tool of FPGAs for DNNs

Seojin Jang,Yongbeom Cho,Sangun Park,Wei Liu

doi:10.3390/electronics11030402

Seojin Jang, Yongbeom Cho + Show 2 more

Open Access

https://doi.org/10.3390/electronics11030402

Copy DOI

Journal: Electronics	Publication Date: Jan 28, 2022
Citations: 2	License type: CC BY 4.0

Affiliation: Konkuk University, Samsung (South Korea)

Abstract

With the increasing use of multi-purpose artificial intelligence of things (AIOT) devices, embedded field-programmable gate arrays (FPGA) represent excellent platforms for deep neural network (DNN) acceleration on edge devices. FPGAs possess the advantages of low latency and high energy efficiency, but the scarcity of FPGA development resources challenges the deployment of DNN-based edge devices. Register-transfer level programming, hardware verification, and precise resource allocation are needed to build a high-performance FPGA accelerator for DNNs. These tasks present a challenge and are time consuming for even experienced hardware developers. Therefore, we propose an automated, collaborative design process employing an automatic design space exploration tool; an automatic DNN engine enables the tool to reshape and parse a DNN model from software to hardware. We also introduce a long short-term memory (LSTM)-based model to predict performance and generate a DNN model that suits the developer requirements automatically. We demonstrate our design scheme with three FPGAs: a zcu104, a zcu102, and a Cyclone V SoC (system on chip). The results show that our hardware-based edge accelerator exhibits superior throughput compared with the most advanced edge graphics processing unit.

Highlights

Recent studies have shown that field-programmable gate arrays (FPGA) are promising candidates for deep neural network (DNN) implementation [1,2,3,4,5]
A DNN can be integrated via hardware, rather than via an existing central processing unit (CPU) or graphics processing unit (GPU), improving latency and reducing energy consumption
This study proposes DNN implementation via an auto-generated automation tool, which maps the DNN design process from a deep learning framework to FPGA

Summary

Introduction

Recent studies have shown that field-programmable gate arrays (FPGA) are promising candidates for deep neural network (DNN) implementation [1,2,3,4,5]. A DNN can be integrated via hardware, rather than via an existing central processing unit (CPU) or graphics processing unit (GPU), improving latency and reducing energy consumption. These characteristics exemplify FPGAs for DNN-based applications in cloud and edge computing; as a result, FPGAs have been rapidly adopted for DNN acceleration. Internet of things (IOT) applications have stringent requirements in the fields of automatic driving, safety, and monitoring; complex DNN models must produce quality results with minimal delay and power consumption, while not exceeding resource constraints [6]. Development resource scarcity makes the design and deployment of FPGA DNN accelerators challenging

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Automatic RTL Generation Tool of FPGAs for DNNs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Accelerating DNNs from local to virtualized FPGA in the Cloud: A survey of trends
Chen Wu ... Virginie Fresse
Journal of Systems Architecture | VOL. 119
Chen Wu, et. al.Chen Wu ... Virginie Fresse
01 Oct 2021
Journal of Systems Architecture | VOL. 119

Model Reverse-Engineering Attack using Correlation Power Analysis against Systolic Array Based Neural Network Accelerator
Kota Yoshida ... Takaya Kubota
-
Kota Yoshida, et. al.Kota Yoshida ... Takaya Kubota
01 Oct 2020
01 Oct 2020

DeepEdgeSoC: End-to-end deep learning framework for edge IoT devices
Mhd Rashed Al Koutayni ... Didier Stricker
Internet of Things | VOL. 21
Mhd Rashed Al Koutayni, et. al.Mhd Rashed Al Koutayni ... Didier Stricker
22 Dec 2022
Internet of Things | VOL. 21

A Co-Scheduling Framework for DNN Models on Mobile and Edge Devices with Heterogeneous Hardware
Zhiyuan Xu ... Jian Tang
IEEE Transactions on Mobile Computing | VOL. -
Zhiyuan Xu, et. al.Zhiyuan Xu ... Jian Tang
01 Jan 2020
IEEE Transactions on Mobile Computing | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatic RTL Generation Tool of FPGAs for DNNs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics