Training Compact DNNs with [formula omitted] Regularization

Anda Tang,Lingfeng Niu,Jianyu Miao,Peng Zhang

doi:10.1016/j.patcog.2022.109206

Abstract

Deep neural network(DNN) has achieved unprecedented success in many fields. However, its large model parameters which bring a great burden on storage and calculation hinder the development and application of DNNs. It is worthy of compressing the model to reduce the complexity of the DNN. Sparsity-inducing regularizer is one of the most common tools for compression. In this paper, we propose utilizing the ℓ1/2 quasi-norm to zero out weights of neural networks and compressing the networks automatically during the learning process. To our knowledge, it is the first work applying the non-Lipschitz continuous regularizer for the compression of DNNs. The resulting sparse optimization problem is solved by stochastic proximal gradient algorithm. For further convenience of calculation, an approximation of the threshold-form solution to the proximal operator with ℓ1/2 is given at the same time. Extensive experiments with various datasets and baselines demonstrate the advantages of our new method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Training Compact DNNs with [formula omitted] Regularization

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: Nov 26, 2022
Citations: 4

Similar Papers

Sparse Optimization Based on Non-convex $$\ell _{1/2}$$ Regularization for Deep Neural Networks
Anda Tang ... Lingfeng Niu
-
Anda Tang, et. al.Anda Tang ... Lingfeng Niu
01 Jan 2020
01 Jan 2020

Human activity recognition via wearable devices using enhanced ternary weight convolutional neural network
Mina Jaberi ... Reza Ravanmehr
Pervasive and Mobile Computing | VOL. 83
Mina Jaberi, et. al.Mina Jaberi ... Reza Ravanmehr
27 May 2022
Pervasive and Mobile Computing | VOL. 83

Deep stable neural networks: Large-width asymptotics and convergence rates
Stefano Favaro ... Sandra Fortini
Bernoulli | VOL. 29
Stefano Favaro, et. al.Stefano Favaro ... Sandra Fortini
01 Aug 2023
Bernoulli | VOL. 29

The 3-dimensional medical image recognition of right and left kidneys by deep GMDH-type neural network
Tadashi Kondo ... Junji Ueno
-
Tadashi Kondo, et. al.Tadashi Kondo ... Junji Ueno
01 Nov 2015
01 Nov 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Training Compact DNNs with [formula omitted] Regularization

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition