FPGA Implementation of a Deep Learning Acceleration Core Architecture for Image Target Detection

Xu Yang,Qiang Wang,Wenquan Feng,Chen Zhuang,Zhe Yang

doi:10.3390/app13074144

Abstract

Due to the flexibility and ease of deployment of Field Programmable Gate Arrays (FPGA), more and more studies have been conducted on developing and optimizing target detection algorithms based on Convolutional Neural Networks (CNN) models using FPGAs. Still, these studies focus on improving the performance of the core algorithm and optimizing hardware structure, with few studies focusing on the unified architecture design and corresponding optimization techniques for the algorithm model, resulting in inefficient overall model performance. The essential reason is that these studies do not address arithmetic power, speed, and resource consistency. In order to solve this problem, we propose a deep learning acceleration core architecture based on FPGAs, which is designed for target detection algorithms with CNN models, using multi-channel parallelization of CNN network models to improve the arithmetic power, using scheduling tasks and intensive computation pipelining to meet the algorithm’s data bandwidth requirements and unifying the speed and area of the orchestrated computation matrix to save hardware resources. The proposed framework achieves 14 Frames Per Second (FPS) inference performance of the TinyYolo model at 5 Giga Operations Per Second (GOPS) with 30% higher running clock frequency, 2–4 times higher arithmetic power, and 28% higher Digital Signal Processing (DSP) resource utilization efficiency using less than 25% of FPGA resource usage.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Mar 24, 2023
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

FPGA Implementation of a Deep Learning Acceleration Core Architecture for Image Target Detection

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Design and Implementation of a Deep Learning Target Detection System
...
-
, et. al. ...
30 Sep 2020
30 Sep 2020

EPA: The effective pipeline architecture for CNN accelerator with high performance and computing efficiency based on FPGA
Junjie Zhang ... Bingyao Cao
Concurrency and Computation: Practice and Experience | VOL. 35
Junjie Zhang, et. al.Junjie Zhang ... Bingyao Cao
31 Mar 2021
Concurrency and Computation: Practice and Experience | VOL. 35

Spatial- and time- division multiplexing in CNN accelerator
Tetsuro Nakamura ... Akinori Shiraga
Parallel Computing | VOL. 111
Tetsuro Nakamura, et. al.Tetsuro Nakamura ... Akinori Shiraga
24 Mar 2022
Parallel Computing | VOL. 111

Artificial intelligence: finding the intersection of predictive modeling and clinical utility
Karthik Ravi
Gastrointestinal Endoscopy | VOL. 93
Karthik RaviKarthik Ravi
07 Mar 2021
Gastrointestinal Endoscopy | VOL. 93

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

FPGA Implementation of a Deep Learning Acceleration Core Architecture for Image Target Detection

Abstract

Talk to us

Similar Papers

More From: Applied Sciences