Efficient convolution architectures for convolutional neural network

Jichen Wang,Zhongfeng Wang,Jun Lin

doi:10.1109/wcsp.2016.7752726

Abstract

Convolutional Neural Network (CNN) is the state-of-the-art deep learning approach employed in various applications due to its remarkable performance. Convolutions in CNNs generally dominate the overall computation complexity and thus consume major computational power in real implementations. In this paper, efficient hardware architectures incorporating parallel fast finite impulse response (FIR) algorithm (FFA) for CNN convolution implementations are discussed. The theoretical derivation of 3 and 5 parallel FFAs is presented and the corresponding 3 and 5 parallel fast convolution units (FCUs) are proposed for most commonly used 3 × 3 and 5 × 5 convolutional kernels in CNNs, respectively. Compared to conventional CNN convolution architectures, the proposed FCUs reduce the number of multiplications used in convolutions significantly. Additionally, the FCUs minimize the number of reads from the feature map memory. Furthermore, a reconfigurable FCU architecture which suits the convolutions of both 3 × 3 and 5 × 5 kernels is proposed. Based on this, an efficient top-level architecture for processing a complete convolutional layer in a CNN is developed. To quantize the benefits of the proposed FCUs, the design of an FCU is coded with RTL and synthesized with TSMC 90nm CMOS technology. The implementation results demonstrate that 30% and 36% of the computational energy can be saved compared to conventional solutions with 3 × 3 and 5 × 5 kernels in CNN, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient convolution architectures for convolutional neural network

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

An attribution graph-based interpretable method for CNNs
Xiangwei Zheng ... Zhen Cui
Neural Networks | VOL. 179
Xiangwei Zheng, et. al.Xiangwei Zheng ... Zhen Cui
05 Aug 2024
Neural Networks | VOL. 179

A Dual Neural Architecture Combined SqueezeNet with OctConv for LiDAR Data Classification.
Aili Wang ... Minhui Wang
Sensors | VOL. 19
Aili Wang, et. al.Aili Wang ... Minhui Wang
12 Nov 2019
Sensors | VOL. 19

Hardware Implementation of Reconfigurable Separable Convolution
Lei Rao ... Bin Zhang
-
Lei Rao, et. al.Lei Rao ... Bin Zhang
01 Jul 2018
01 Jul 2018

Learnable Gabor kernels in convolutional neural networks for seismic facies classification
F Wang ... T Alkhalifah
-
F Wang, et. al.F Wang ... T Alkhalifah
01 Jan 2023
01 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient convolution architectures for convolutional neural network

Abstract

Talk to us

Similar Papers