FPGA‐accelerated deep convolutional neural networks for high throughput and energy efficiency

Yuran Qiao,Chunyuan Zhang,Tao Xiao,Junzhong Shen,Qianming Yang,Mei Wen

doi:10.1002/cpe.3850

Abstract

SummaryRecent breakthroughs in the deep convolutional neural networks (CNNs) have led to great improvements in the accuracy of both vision and auditory systems. Characterized by their deep structures and large numbers of parameters, deep CNNs challenge the computational performance of today. Hardware specialization in the form of field‐programmable gate array offers a promising path towards major leaps in computational performance while achieving high‐energy efficiency.In this paper, we focus on accelerating deep CNNs using the Xilinx Zynq‐zq7045 FPGA SoC. As most of the computational workload can be converted to matrix multiplications, we adopt a matrix multiplier‐based accelerator architecture. Dedicated units are designed to eliminate the conversion overhead. We also design a customized memory system according to the memory access pattern of CNNs. To make the accelerator easily usable by application developers, our accelerator supports Caffe, which is a widely used software framework of deep CNN. Different CNN models can be adopted by our accelerator, with good performance portability. The experimental results show that for a typical application of CNN, image classification, an average throughout of 77.8 GFLOPS is achieved, while the energy efficiency is 4.7× better than an Nvidia K20 GPGPU. © 2016 The Authors. Concurrency and Computation: Practice and Experience Published by John Wiley & Sons Ltd

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Concurrency and Computation: Practice and Experience	Publication Date: May 6, 2016
Citations: 52	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

FPGA‐accelerated deep convolutional neural networks for high throughput and energy efficiency

Abstract

Talk to us

Similar Papers

More From: Concurrency and Computation: Practice and Experience

Lead the way for us

Similar Papers

Artificial intelligence: finding the intersection of predictive modeling and clinical utility
Karthik Ravi
Gastrointestinal Endoscopy | VOL. 93
Karthik RaviKarthik Ravi
07 Mar 2021
Gastrointestinal Endoscopy | VOL. 93

Prediction of Diabetic Retinopathy using Deep Learning with Preprocessing
S Balaji ... D Gokulakrishnan
EAI Endorsed Transactions on Pervasive Health and Technology | VOL. 10
S Balaji, et. al.S Balaji ... D Gokulakrishnan
22 Feb 2024
EAI Endorsed Transactions on Pervasive Health and Technology | VOL. 10

High-Resolution Remote Sensing Image Retrieval Based on CNNs from a Dimensional Perspective
Zhifeng Xiao ... Gefu Tang
Remote Sensing | VOL. 9
Zhifeng Xiao, et. al.Zhifeng Xiao ... Gefu Tang
14 Jul 2017
Remote Sensing | VOL. 9

Deep Learning Framework of Convolutional Neural Network (CNN) and Attention CNN for Early Diagnosis of Alzheimer's Disease
Hoda K Mohamed ... Ahmed Abdelhafeez
International Journal of Advances in Applied Computational Intelligence | VOL. 3
Hoda K Mohamed, et. al.Hoda K Mohamed ... Ahmed Abdelhafeez
01 Jan 2023
International Journal of Advances in Applied Computational Intelligence | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

FPGA‐accelerated deep convolutional neural networks for high throughput and energy efficiency

Abstract

Talk to us

Similar Papers

More From: Concurrency and Computation: Practice and Experience