FPGA Based Reconfigurable Coprocessor for Deep Convolutional Neural Network Training

Sajna Remi Clere,Sachin Sethumadhavan,Kuruvilla Varghese

doi:10.1109/dsd.2018.00072

Abstract

Deep Convolutional Neural Network (DCNN) is a class of machine learning algorithms that has wide application in pattern recognition, image recognition and video analysis. Convolutional layers in the network extract various features from a set of inputs and adapt parameters, before they do the classification. Training of DCNN is computationally intensive and has large memory requirement, but offers multiple degrees of parallelism, as similar structures are needed for computation at various intermediate stages. Training using a general purpose processing unit does not utilize parallelism of the network, and hence, is very time and energy inefficient. In this paper, we propose a coprocessor for accelerating the training of Convolutional Neural Network using a Xilinx Kintex Ultrascale XCKU085 based HTG-K800 FPGA board. DCNN is trained using back propagation algorithm. The coprocessor can be configured for a new network structure by changing the contents of Block Memory in the FPGA, without re-synthesizing and implementing using the design software. The reconfigurability through DDR can be supported with the architecture but is not implemented. The implementation achieves a maximum throughput of 280GOp/s.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

FPGA Based Reconfigurable Coprocessor for Deep Convolutional Neural Network Training

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

WE‐DE‐207B‐02: Detection of Masses On Mammograms Using Deep Convolutional Neural Network: A Feasibility Study
S Suzuki ... X Zhang
Medical Physics | VOL. 43
S Suzuki, et. al.S Suzuki ... X Zhang
01 Jun 2016
WE‐DE‐207B‐02: Detection of Masses On Mammograms Using Deep Convolutional Neural Network: A Feasibility Study
S Suzuki ... X Zhang

Aggregating Deep Convolutional Neural Network Scans of Broad-Area High-Resolution Remote Sensing Imagery
Grant J Scott ... Curt H Davis
-
Grant J Scott, et. al.Grant J Scott ... Curt H Davis
01 Jul 2018
01 Jul 2018

Competing ratio loss for discriminative multi-class image classification
Ke Zhang ... Tony X Han
Neurocomputing | VOL. 464
Ke Zhang, et. al.Ke Zhang ... Tony X Han
27 Aug 2021
Neurocomputing | VOL. 464

Applying Deep Learning Approach to the Far-Field Subwavelength Imaging Based on Near-Field Resonant Metalens at Microwave Frequencies
He Ming Yao ... Min Li
IEEE Access | VOL. 7
He Ming Yao, et. al.He Ming Yao ... Min Li
01 Jan 2019
IEEE Access | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

FPGA Based Reconfigurable Coprocessor for Deep Convolutional Neural Network Training

Abstract

Talk to us

Similar Papers