Dynamic Regularization on Activation Sparsity for Neural Network Efficiency Improvement

Qing Yang,Zuoguan Wang,“Helen” Li Hai,Jiachen Mao

doi:10.1145/3447776

Abstract

When deploying deep neural networks in embedded systems, it is crucial to decrease the model size and computational complexity for improving the execution speed and efficiency. In addition to conventional compression techniques, e.g., weight pruning and quantization, removing unimportant activations can also dramatically reduce the amount of data communication and the computation cost. Unlike weight parameters, the pattern of activations is directly related to input data and thereby changes dynamically. To regulate the dynamic activation sparsity (DAS), in this work, we propose a generic low-cost approach based on winners-take-all (WTA) dropout technique. The network enhanced by the proposed WTA dropout, namely DASNet , features structured activation sparsity with an improved sparsity level. Compared to the static feature map pruning methods, DASNets provide better computation cost reduction. The WTA dropout technique can be easily applied in deep neural networks without incurring additional training variables. More importantly, DASNet can be seamlessly integrated with other compression techniques, such as weight pruning and quantization, without compromising accuracy. Our experiments on various networks and datasets present significant runtime speedups with negligible accuracy losses.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dynamic Regularization on Activation Sparsity for Neural Network Efficiency Improvement

Abstract

Talk to us

Similar Papers

More From: ACM Journal on Emerging Technologies in Computing Systems

Lead the way for us

Journal: ACM Journal on Emerging Technologies in Computing Systems	Publication Date: Jun 30, 2021
Citations: 1

Similar Papers

DASNet: Dynamic Activation Sparsity for Neural Network Efficiency Improvement
Qing Yang ... Hai Li
-
Qing Yang, et. al.Qing Yang ... Hai Li
01 Nov 2019
01 Nov 2019

Towards efficient deep neural network execution with model compression and platform-specific optimization
Xiaolong Ma
-
Xiaolong MaXiaolong Ma
10 Feb 2023
10 Feb 2023

High-performance and energy-efficient deep learning for resource-constrained devices
Ao Ren
-
Ao RenAo Ren
10 May 2021
10 May 2021

ADMM-NN
Ao Ren ... Tianyun Zhang
-
Ao Ren, et. al.Ao Ren ... Tianyun Zhang
04 Apr 2019
04 Apr 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dynamic Regularization on Activation Sparsity for Neural Network Efficiency Improvement

Abstract

Talk to us

Similar Papers

More From: ACM Journal on Emerging Technologies in Computing Systems