Abstract

Autonomous and intelligent systems based on deep learning, continuously attract the attention of researchers and engineers. With the progress on the application of deep learning for modern applications arises the challenge of reaching real-time processing. To face this challenge, Field Programmable Gate Arrays (FPGAs) can be used; however, deep learning generic implementations on an FPGA are still a topic of research. Advances in FPGA technology allow for designs based on High-Level Synthesis (HLS) for accelerating and facilitating implementations of complex problems on hardware. A platform based on HLS for emulating a generic parameterizable deep learning system on an FPGA is proposed in this paper, allowing for the implementation of any structure based on the following layers: convolution, max-pooling, batch-normalization, and fully connected networks. Through this platform, it is possible to implement a deep learning system on FPGAs using an N-Fold or a Flow architecture without the assistance of central processing units. Whereas the N-Fold architecture requires fewer hardware resources, as it re-uses resources, the Flow architecture presents a higher throughput. The developed platform improves the deep learning design productivity by automating the generation of the system, achieving efficiency and raising the level of abstraction, as was experimentally verified and evaluated.

Highlights

  • A Large quantity of computation is required to analyze data based on deep learning [37]

  • COMPARATIVE EVALUATION WITH STATE-OF-THE-ART The Convolutional Neural Networks (CNNs) applied for the performance comparison between the proposed architectures and the state-of-the-art Field Programmable Gate Arrays (FPGAs) implementations has in the first layer 11 convolutions, with 3×3 kernels and 3 feature maps as input, producing maps of size 24×24

  • A platform to deploy a generic parameterizable-based deep learning on an FPGA was proposed in this paper

Read more

Summary

INTRODUCTION

A Large quantity of computation is required to analyze data based on deep learning [37]. A novel way of solving this problem, proposed in this paper, is to increase the abstraction to the system level by providing a platform to deploy a generic parameterizablebased deep learning on an FPGA. Such a platform provides designers with a choice in choosing or developing their own deep learning framework while providing efficient implementations. Two different architectures are provided to make the platform more efficient and useful, allowing the designer to adjust the achieved throughput and the required resources In this way, the designer has the freedom to perform design space exploration, which allows for finer customization. The main advantage of using such a platform is that the designer can implement deep learning on FPGAs without the necessity of an in-depth understanding of the underlying hardware

BACKGROUND
RELATED WORK
ARCHITECTURES TO DESIGN DEEP NEURAL NETWORKS
MAIN OPERATION MODULES
CONVOLUTION NEURAL NETWORKS
Findings
CONCLUSIONS
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call