Design Space Exploration of Matrix–Matrix Convolution Operation

Piyalee Behera,Arighna Deb

doi:10.1142/s0218126621502881

Abstract

Convolution is an important operation in neural networks which, in recent years, received significant attention from the researchers thanks to its ability to handle complex tasks such as image processing, computer vision in an efficient manner. In general, the convolution operation in neural networks considers two matrices as inputs: an image matrix representing an image and a kernel matrix required for necessary image processing operation and performs several multiplications and addition operations among the elements of image and kernel matrices. Realizing a circuit structure for matrix–matrix convolution is straightforward as each multiplication is realized by a multiplier, whereas an addition is carried out by an adder. However, the corresponding circuits result in large area, high power consumption and long delay because of the large number of multiplications and additions that are involved in the matrix–matrix convolution operations. While, the existing approaches focus on the accelerations of this computationally intensive tasks, they often do not guarantee minimality of area, power and delay. But we show that there exists design aspects through which the circuit structures for convolution operations can be realized with less area, power and delay. To do this, we consider the kernel definitions during the design of the circuit structures since the kernel matrices are often (pre)-determined based on the desired applications. Motivated by this, we first explore the design space of the convolution operation by introducing an alternative design scheme for realizing the respective operation between two matrices keeping the image processing/neural network applications in mind. Experimental evaluations confirm the potential benefits of the proposed design scheme and demonstrate that the reductions in the area and power by approximately [Formula: see text] and critical path delay by approximately [Formula: see text] can be achieved using the proposed design scheme. In addition, the FPGA implementations of the proposed scheme also show that the reductions of approximately [Formula: see text] and [Formula: see text] in the number of LUTs and in the number of pins, respectively, can be achieved. Compared to prior works, the proposed scheme allows higher parallelism with minimum LUT utilization.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Design Space Exploration of Matrix–Matrix Convolution Operation

Abstract

Talk to us

Similar Papers

More From: Journal of Circuits, Systems and Computers

Lead the way for us

Journal: Journal of Circuits, Systems and Computers	Publication Date: Jun 18, 2021
Citations: 1

Similar Papers

Signal processing algorithm for neural networks with integrodifferential splines as an activation function and its particular case of image classification
T.K Biryukova
Highly available systems | VOL. -
T.K BiryukovaT.K Biryukova
01 Jan 2020
Highly available systems | VOL. -

Approximation by the Extended Neural Network Operators of Kantorovich Type
Chenghao Xiang ... Peixin Ye
Mathematics | VOL. 11
Chenghao Xiang, et. al.Chenghao Xiang ... Peixin Ye
17 Apr 2023
Mathematics | VOL. 11

Training and Operation of Multi-layer Convolutional Neural Network Using Electronic Synapses
Yi Ding ... Ding Luo
Journal of Physics: Conference Series | VOL. 1631
Yi Ding, et. al.Yi Ding ... Ding Luo
01 Sep 2020
Journal of Physics: Conference Series | VOL. 1631

Multivariate neural network operators with sigmoidal activation functions
Danilo Costarelli ... Renato Spigler
Neural Networks | VOL. 48
Danilo Costarelli, et. al.Danilo Costarelli ... Renato Spigler
06 Aug 2013
Neural Networks | VOL. 48

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Design Space Exploration of Matrix–Matrix Convolution Operation

Abstract

Talk to us

Similar Papers

More From: Journal of Circuits, Systems and Computers