Low Complexity Gradient Computation Techniques to Accelerate Deep Neural Network Training.

Dongyeob Shin,Joongho Jo,Jongsun Park,Geonho Kim

doi:10.1109/tnnls.2021.3130991

Dongyeob Shin, Joongho Jo + Show 2 more

https://doi.org/10.1109/tnnls.2021.3130991

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Deep neural network (DNN) training is an iterative process of updating network weights, called gradient computation, where (mini-batch) stochastic gradient descent (SGD) algorithm is generally used. Since SGD inherently allows gradient computations with noise, the proper approximation of computing weight gradients within SGD noise can be a promising technique to save energy/time consumptions during DNN training. This article proposes two novel techniques to reduce the computational complexity of the gradient computations for the acceleration of SGD-based DNN training. First, considering that the output predictions of a network (confidence) change with training inputs, the relation between the confidence and the magnitude of the weight gradient can be exploited to skip the gradient computations without seriously sacrificing the accuracy, especially for high confidence inputs. Second, the angle diversity-based approximations of intermediate activations for weight gradient calculation are also presented. Based on the fact that the angle diversity of gradients is small (highly uncorrelated) in the early training epoch, the bit precision of activations can be reduced to 2-/4-/8-bit depending on the resulting angle error between the original gradient and quantized gradient. The simulations show that the proposed approach can skip up to 75.83% of gradient computations with negligible accuracy degradation for CIFAR-10 dataset using ResNet-20. Hardware implementation results using 65-nm CMOS technology also show that the proposed training accelerator achieves up to 1.69x energy efficiency compared with other training accelerators.

Full Text

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Neural Networks and Learning Systems	Publication Date: Sep 1, 2023
Citations: 2	License type: publisher-specific, author manuscript

R Discovery Prime

Low Complexity Gradient Computation Techniques to Accelerate Deep Neural Network Training.

Abstract

Published Version

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems

Lead the way for us

Similar Papers

Prediction Confidence based Low Complexity Gradient Computation for Accelerating DNN Training
Dongyeob Shin ... Jongsun Park
-
Dongyeob Shin, et. al.Dongyeob Shin ... Jongsun Park
01 Jul 2020
01 Jul 2020

Neuroevolution in Deep Neural Networks: Current Trends and Future Challenges
Edgar Galvan ... Peter Mooney
IEEE Transactions on Artificial Intelligence | VOL. 2
Edgar Galvan, et. al.Edgar Galvan ... Peter Mooney
04 May 2021
IEEE Transactions on Artificial Intelligence | VOL. 2

Investigations on hessian-free optimization for cross-entropy training of deep neural networks
Simon Wiesler ... Jian Xue
-
Simon Wiesler, et. al.Simon Wiesler ... Jian Xue
25 Aug 2013
25 Aug 2013

A Framework for Distributed Deep Neural Network Training with Heterogeneous Computing Platforms
Bontak Gu ... Arslan Munir
-
Bontak Gu, et. al.Bontak Gu ... Arslan Munir
01 Dec 2019
01 Dec 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Low Complexity Gradient Computation Techniques to Accelerate Deep Neural Network Training.

Abstract

Published Version

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems