Acceleration of Deep Neural Network Training Using Field Programmable Gate Arrays.

Guta Tesema Tufa,Fitsum Assamnew Andargie,Anchit Bijalwan

doi:10.1155/2022/8387364

Abstract

Convolutional neural network (CNN) training often necessitates a considerable amount of computational resources. In recent years, several studies have proposed for CNN inference and training accelerators in which the FPGAs have previously demonstrated good performance and energy efficiency. To speed up the processing, CNN requires additional computational resources such as memory bandwidth, a FPGA platform resource usage, time, power consumption, and large datasets for training. They are constrained by the requirement for improved hardware acceleration to support scalability beyond existing data and model sizes. This paper proposes a procedure for energy efficient CNN training in collaboration with an FPGA-based accelerator. We employed optimizations such as quantization, which is a common model compression technique, to speed up the CNN training process. Additionally, a gradient accumulation buffer is used to ensure maximum operating efficiency while maintaining gradient descent of the learning algorithm. To validate the design, we implemented the AlexNet and VGG-16 models on an FPGA board and laptop CPU along side GPU. It achieves 203.75 GOPS on Terasic DE1 SoC with the AlexNet model and 196.50 GOPS with the VGG-16 model on Terasic DE-SoC. Our result also exhibits that the FPGA accelerators are more energy efficient than other platforms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computational Intelligence and Neuroscience	Publication Date: Oct 17, 2022
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Acceleration of Deep Neural Network Training Using Field Programmable Gate Arrays.

Abstract

Talk to us

Similar Papers

More From: Computational Intelligence and Neuroscience

Lead the way for us

Similar Papers

Compressed CNN Training with FPGA-based Accelerator
Kaiyuan Guo ... Yu Wang
-
Kaiyuan Guo, et. al.Kaiyuan Guo ... Yu Wang
20 Feb 2019
20 Feb 2019

TurboDL: Improving the CNN Training on GPU With Fine-Grained Multi-Streaming Scheduling
Hai Jin ... Xuanhua Shi
IEEE Transactions on Computers | VOL. 70
Hai Jin, et. al.Hai Jin ... Xuanhua Shi
08 May 2020
IEEE Transactions on Computers | VOL. 70

FPGA-Based Accelerator for Deep Convolutional Neural Networks for the SPARK Environment
Raghid Morcel ... Mazen Ezzeddine
-
Raghid Morcel, et. al.Raghid Morcel ... Mazen Ezzeddine
01 Nov 2016
01 Nov 2016

Comparative characteristics of the ability of convolutional neural networks to the concept of transfer learning
Vladimir Khotsyanovsky
Technology audit and production reserves | VOL. 1
Vladimir KhotsyanovskyVladimir Khotsyanovsky
11 Feb 2022
Technology audit and production reserves | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Acceleration of Deep Neural Network Training Using Field Programmable Gate Arrays.

Abstract

Talk to us

Similar Papers

More From: Computational Intelligence and Neuroscience