SnaPEA: Predictive Early Activation for Reducing Computation in Deep Convolutional Neural Networks

Vahideh Akhlaghi,Rajesh K Gupta,Amir Yazdanbakhsh,Kambiz Samadi,Hadi Esmaeilzadeh

doi:10.1109/isca.2018.00061

Abstract

Deep Convolutional Neural Networks (CNNs) perform billions of operations for classifying a single input. To reduce these computations, this paper offers a solution that leverages a combination of runtime information and the algorithmic structure of CNNs. Specifically, in numerous modern CNNs, the outputs of compute-heavy convolution operations are fed to activation units that output zero if their input is negative. By exploiting this unique algorithmic property, we propose a predictive early activation technique, dubbed SnaPEA. This technique cuts the computation of convolution operations short if it determines that the output will be negative. SnaPEA can operate in two distinct modes, exact and predictive. In the exact mode, with no loss in classification accuracy, SnaPEA statically re-orders the weights based on their signs and periodically performs a single-bit sign check on the partial sum. Once the partial sum drops below zero, the rest of computations can simply be ignored, since the output value will be zero in any case. In the predictive mode, which trades the classification accuracy for larger savings, SnaPEA speculatively cuts the computation short even earlier than the exact mode. To control the accuracy, we develop a multi-variable optimization algorithm that thresholds the degree of speculation. As such, the proposed algorithm exposes a knob to gracefully navigate the trade-offs between the classification accuracy and computation reduction. Compared to a state-of-the-art CNN accelerator, SnaPEA in the exact mode, yields, on average, 28% speedup and 16% energy reduction in various modern CNNs without affecting their classification accuracy. With 3% loss in classification accuracy, on average, 67.8% of the convolutional layers can operate in the predictive mode. The average speedup and energy saving of these layers are 2.02x and 1.89x, respectively. The benefits grow to a maximum of 3.59x speedup and 3.14x energy reduction. Compared to static pruning approaches, which are complimentary to the dynamic approach of SnaPEA, our proposed technique offers up to 63% speedup and 49% energy reduction across the convolution layers with no loss in classification accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SnaPEA: Predictive Early Activation for Reducing Computation in Deep Convolutional Neural Networks

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Automatic Generation of Multi-Precision Multi-Arithmetic CNN Accelerators for FPGAs
Yiren Zhao ... Robert Mullins
-
Yiren Zhao, et. al.Yiren Zhao ... Robert Mullins
01 Dec 2019
01 Dec 2019

Escher: A CNN Accelerator with Flexible Buffering to Minimize Off-Chip Transfer
Yongming Shen ... Michael Ferdman
-
Yongming Shen, et. al.Yongming Shen ... Michael Ferdman
01 Apr 2017
01 Apr 2017

A Substitution of Convolutional Layers by FFT Layers - A Low Computational Cost Version
Umar Farooq Mohammad ... Mohamed Almekkawy
-
Umar Farooq Mohammad, et. al.Umar Farooq Mohammad ... Mohamed Almekkawy
11 Sep 2021
11 Sep 2021

Towards Efficient Forward Propagation on Resource-Constrained Systems
Günther Schindler ... Matthias Zöhrer
-
Günther Schindler, et. al.Günther Schindler ... Matthias Zöhrer
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SnaPEA: Predictive Early Activation for Reducing Computation in Deep Convolutional Neural Networks

Abstract

Talk to us

Similar Papers