On energy complexity of fully-connected layers

Jiří Šíma,Jérémie Cabessa,Petra Vidnerová

doi:10.1016/j.neunet.2024.106419

Abstract

The massive increase in the size of deep neural networks (DNNs) is accompanied by a significant increase in energy consumption of their hardware implementations which is critical for their widespread deployment in low-power mobile devices. In our previous work, an abstract hardware-independent model of energy complexity for convolutional neural networks (CNNs) has been proposed and experimentally validated. Based on this model, we provide a theoretical analysis of energy complexity related to the computation of a fully-connected layer when its inputs, outputs, and weights are transferred between two kinds of memories (DRAM and Buffer). First, we establish a general lower bound on this energy complexity. Then, we present two dataflows and calculate their energy costs to achieve the corresponding upper bounds. In the case of a partitioned Buffer, we prove by the weak duality theorem from linear programming that the lower and upper bounds coincide up to an additive constant, and therefore establish the optimal energy complexity. Finally, the asymptotically optimal quadratic energy complexity of fully-connected layers is experimentally validated by estimating their energy consumption on the Simba and Eyeriss hardware.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

On energy complexity of fully-connected layers

Abstract

Talk to us

Similar Papers

More From: Neural Networks

Lead the way for us

Similar Papers

Energy Complexity of Convolutional Neural Networks.
Jiří Šíma ... Vojtěch Mrázek
Neural computation | VOL. 36
Jiří Šíma, et. al.Jiří Šíma ... Vojtěch Mrázek
20 May 2024
Neural computation | VOL. 36

DeepIoT
Shuochao Yao ... Yiran Zhao
-
Shuochao Yao, et. al.Shuochao Yao ... Yiran Zhao
06 Nov 2017
06 Nov 2017

Kernel-wise difference minimization for convolutional neural network compression in metaverse.
Yi-Ting Chang
Frontiers in big data | VOL. 6
Yi-Ting ChangYi-Ting Chang
04 Aug 2023
Frontiers in big data | VOL. 6

Deep distributed convolutional neural networks: Universality
Ding-Xuan Zhou
Analysis and Applications | VOL. 16
Ding-Xuan ZhouDing-Xuan Zhou
01 Nov 2018
Analysis and Applications | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On energy complexity of fully-connected layers

Abstract

Talk to us

Similar Papers

More From: Neural Networks