DASNet: Dynamic Activation Sparsity for Neural Network Efficiency Improvement

Qing Yang,Jiachen Mao,Zuoguan Wang,Hai Li

doi:10.1109/ictai.2019.00197

Abstract

To improve the execution speed and efficiency of neural networks in embedded systems, it is crucial to decrease the model size and computational complexity. In addition to conventional compression techniques, e.g., weight pruning and quantization, removing unimportant activations can reduce the amount of data communication and the computation cost. Unlike weight parameters, the pattern of activations is directly related to input data and thereby changes dynamically. To regulate the dynamic activation sparsity (DAS), in this work, we propose a generic low-cost approach based on winners-take-all (WTA) dropout technique. The network enhanced by the proposed WTA dropout, namely DASNet, features structured activation sparsity with an improved sparsity level. Compared to the static feature map pruning methods, DASNets provide better computation cost reduction. The WTA technique can be easily applied in deep neural networks without incurring additional training variables. Our experiments on various networks and datasets present significant run-time speedups with negligible accuracy loss.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DASNet: Dynamic Activation Sparsity for Neural Network Efficiency Improvement

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Dynamic Regularization on Activation Sparsity for Neural Network Efficiency Improvement
Qing Yang ... Jiachen Mao
ACM Journal on Emerging Technologies in Computing Systems | VOL. 17
Qing Yang, et. al.Qing Yang ... Jiachen Mao
30 Jun 2021
ACM Journal on Emerging Technologies in Computing Systems | VOL. 17

Towards efficient deep neural network execution with model compression and platform-specific optimization
Xiaolong Ma
-
Xiaolong MaXiaolong Ma
10 Feb 2023
10 Feb 2023

GenSyth: a new way to understand deep learning
Alexander Wong ... Mohammad Javad Shafiee
Electronics Letters | VOL. 55
Alexander Wong, et. al.Alexander Wong ... Mohammad Javad Shafiee
01 Sep 2019
Electronics Letters | VOL. 55

A Novel Pooling Method for Regularization of Deep Neural networks
El Houssaine Hssayni ... Mohamed Ettaouil
-
El Houssaine Hssayni, et. al.El Houssaine Hssayni ... Mohamed Ettaouil
01 Jun 2020
01 Jun 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DASNet: Dynamic Activation Sparsity for Neural Network Efficiency Improvement

Abstract

Talk to us

Similar Papers