Raising the Abstraction Level of a Deep Learning Design on FPGAs

Dario Baptista,Fernando Morgado-Dias,Leonel Sousa

doi:10.1109/access.2020.3036975

Abstract

Autonomous and intelligent systems based on deep learning, continuously attract the attention of researchers and engineers. With the progress on the application of deep learning for modern applications arises the challenge of reaching real-time processing. To face this challenge, Field Programmable Gate Arrays (FPGAs) can be used; however, deep learning generic implementations on an FPGA are still a topic of research. Advances in FPGA technology allow for designs based on High-Level Synthesis (HLS) for accelerating and facilitating implementations of complex problems on hardware. A platform based on HLS for emulating a generic parameterizable deep learning system on an FPGA is proposed in this paper, allowing for the implementation of any structure based on the following layers: convolution, max-pooling, batch-normalization, and fully connected networks. Through this platform, it is possible to implement a deep learning system on FPGAs using an N-Fold or a Flow architecture without the assistance of central processing units. Whereas the N-Fold architecture requires fewer hardware resources, as it re-uses resources, the Flow architecture presents a higher throughput. The developed platform improves the deep learning design productivity by automating the generation of the system, achieving efficiency and raising the level of abstraction, as was experimentally verified and evaluated.

Highlights

A Large quantity of computation is required to analyze data based on deep learning [37]
COMPARATIVE EVALUATION WITH STATE-OF-THE-ART The Convolutional Neural Networks (CNNs) applied for the performance comparison between the proposed architectures and the state-of-the-art Field Programmable Gate Arrays (FPGAs) implementations has in the first layer 11 convolutions, with 3×3 kernels and 3 feature maps as input, producing maps of size 24×24
A platform to deploy a generic parameterizable-based deep learning on an FPGA was proposed in this paper

Summary

INTRODUCTION

A Large quantity of computation is required to analyze data based on deep learning [37]. A novel way of solving this problem, proposed in this paper, is to increase the abstraction to the system level by providing a platform to deploy a generic parameterizablebased deep learning on an FPGA. Such a platform provides designers with a choice in choosing or developing their own deep learning framework while providing efficient implementations. Two different architectures are provided to make the platform more efficient and useful, allowing the designer to adjust the achieved throughput and the required resources In this way, the designer has the freedom to perform design space exploration, which allows for finer customization. The main advantage of using such a platform is that the designer can implement deep learning on FPGAs without the necessity of an in-depth understanding of the underlying hardware

BACKGROUND

RELATED WORK

ARCHITECTURES TO DESIGN DEEP NEURAL NETWORKS

MAIN OPERATION MODULES

CONVOLUTION NEURAL NETWORKS

Findings

CONCLUSIONS

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE access : practical innovations, open solutions	Publication Date: Jan 1, 2020
Citations: 36	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Raising the Abstraction Level of a Deep Learning Design on FPGAs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE access : practical innovations, open solutions

Lead the way for us

Similar Papers

FPGAs for Software Programmers
Dirk Koch ... Daniel Ziener
-
Dirk Koch, et. al.Dirk Koch ... Daniel Ziener
01 Jan 2015
01 Jan 2015

Application of FPGAs to High-Speed Condition Based Maintenance of Rolling Element Bearings
Mark Harriman ... Rob Parkin
-
Mark Harriman, et. al.Mark Harriman ... Rob Parkin
01 Jan 2004
01 Jan 2004

Implementation of a High Throughput Variable Decimation Pane Filter Using the Xilinx System Generator
Jerahmie Radder
-
Jerahmie RadderJerahmie Radder
01 Jan 2003
01 Jan 2003

Chaos secure communication’ implementation in FPGA
T Tami ... T Messaoudene
-
T Tami, et. al.T Tami ... T Messaoudene
01 Nov 2018
01 Nov 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Raising the Abstraction Level of a Deep Learning Design on FPGAs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE access : practical innovations, open solutions