FPGA Hardware Acceleration Design for Deep Learning

Haochen Shi

doi:10.54097/hset.v39i.6543

Abstract

A type of artificial neural network called a convolutional neural network (CNN) can learn characteristics from a huge amount of data and performs very well in the field of large-scale image processing. CNN simulates the behavior of a biological optic nerve. In recent years, with the development of deep neural network algorithms and hardware technology, the current "CPU+GPU" model servers cannot meet the neural network structure in various fields, so a large amount of deep CNN accelerators based on the FPGA platform have gradually emerged. FPGA is beginning to be used in the fields of image recognition and natural language processing because of its programmability, high performance, high stability, high security, and low power consumption. Though FPGA has proven to have better performance, there is still room for optimization at the design level. Yolov3, as a classical algorithm, still consumes a lot of time and computational resources in actual operations. To address this problem, this experiment partially optimizes the Yolov3 algorithm by introducing the CBAM attention mechanism in the Yolov3 model and pruning the embedded system with different proportions using the Network slimming method. Finally, it is verified on a TX2 embedded device developed by Nvidia using the COCO dataset. The experiment finds that the precision, mAP, and the number of parameters of the optimized Yolov3 algorithm under different optimization strategies. It is shown that the Yolov3 algorithm still has more optimization strategies that can reduce the time required for computation and the memory occupied more effectively without any degradation in accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Highlights in Science, Engineering and Technology	Publication Date: Apr 1, 2023
Citations: 1	License type: CC BY-NC 4.0

R Discovery Prime

R Discovery Prime

FPGA Hardware Acceleration Design for Deep Learning

Abstract

Talk to us

Similar Papers

More From: Highlights in Science, Engineering and Technology

Lead the way for us

Similar Papers

Sentiment Analysis using a CNN-BiLSTM Deep Model Based on Attention Classification
Wang Yue ... Li Lei
Information | VOL. 26
Wang Yue, et. al.Wang Yue ... Li Lei
15 Sep 2023
Information | VOL. 26

Research On Text Classification Based On Deep Neural Network
Deageon Kim
International Journal of Communication Networks and Information Security (IJCNIS) | VOL. 14
Deageon KimDeageon Kim
31 Dec 2022
International Journal of Communication Networks and Information Security (IJCNIS) | VOL. 14

Bidirectional Recurrent Convolutional Neural Network for Relation Classification
Rui Cai ... Xiaodong Zhang
-
Rui Cai, et. al.Rui Cai ... Xiaodong Zhang
01 Jan 2015
01 Jan 2015

Research on improved convolutional wavelet neural network
Jingwei Liu ... Jiaming Chen
Scientific Reports | VOL. 11
Jingwei Liu, et. al.Jingwei Liu ... Jiaming Chen
09 Sep 2021
Scientific Reports | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

FPGA Hardware Acceleration Design for Deep Learning

Abstract

Talk to us

Similar Papers

More From: Highlights in Science, Engineering and Technology