Combine-Net: An Improved Filter Pruning Algorithm

Jinghan Wang,Wenzhao Zhang,Guangyue Li

doi:10.3390/info12070264

Jinghan Wang, Wenzhao Zhang + Show 1 more

Open Access

https://doi.org/10.3390/info12070264

Copy DOI

Journal: Information	Publication Date: Jun 29, 2021
Citations: 2	License type: CC BY 4.0

Affiliation: China University of Geosciences, Zhejiang University

Abstract

The powerful performance of deep learning is evident to all. With the deepening of research, neural networks have become more complex and not easily generalized to resource-constrained devices. The emergence of a series of model compression algorithms makes artificial intelligence on edge possible. Among them, structured model pruning is widely utilized because of its versatility. Structured pruning prunes the neural network itself and discards some relatively unimportant structures to compress the model’s size. However, in the previous pruning work, problems such as evaluation errors of networks, empirical determination of pruning rate, and low retraining efficiency remain. Therefore, we propose an accurate, objective, and efficient pruning algorithm—Combine-Net, introducing Adaptive BN to eliminate evaluation errors, the Kneedle algorithm to determine the pruning rate objectively, and knowledge distillation to improve the efficiency of retraining. Results show that, without precision loss, Combine-Net achieves 95% parameter compression and 83% computation compression on VGG16 on CIFAR10, 71% of parameter compression and 41% computation compression on ResNet50 on CIFAR100. Experiments on different datasets and models have proved that Combine-Net can efficiently compress the neural network’s parameters and computation.

Highlights

With the increasing popularity of Internet of Things technology (IoT), different kinds of sensors emerge, carrying a massive amount of raw data
Deep learning models are not readily able to be deployed on resource-constrained devices or work smoothly for applications with stringent Quality of Experience (QoE) requirements
The experiments of VGG16 on CIFAR10 showed that: (1) after pruning with a 95% rate, the accuracy of the sub-network corrected by Adaptive Batch Normalization (BN) operation was improved by about 40% compared with the one without this method, which reflects the performance of the sub-network better

Summary

Introduction

With the increasing popularity of Internet of Things technology (IoT), different kinds of sensors emerge, carrying a massive amount of raw data. How to efficiently extract useful knowledge from such an amount of raw data has become a problem. To achieve better results, deep learning models usually have to go wider and deeper, which incurs high computational costs in terms of storage, memory, latency, and energy. Deep learning models are not readily able to be deployed on resource-constrained devices or work smoothly for applications with stringent Quality of Experience (QoE) requirements. Compressing a computationally intensive model is a potential solution to facilitate ubiquitous deep learning models on resource-constrained devices or for applications under harsh QoE conditions. From the aforementioned methods, pruning, requiring much less expertise, can be applied to pre-trained models, and the accuracy loss through retraining can be constrained. The above merits make pruning a better choice for model compression

Methods

Findings

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Combine-Net: An Improved Filter Pruning Algorithm

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information

Lead the way for us

Similar Papers

Automatic Pruning Rate Derivation for Structured Pruning of Deep Neural Networks
Yasufumi Sakai ... Atsuki Inoue
-
Yasufumi Sakai, et. al.Yasufumi Sakai ... Atsuki Inoue
21 Aug 2022
21 Aug 2022

Exploiting Sparse Self-Representation and Particle Swarm Optimization for CNN Compression.
Sijie Niu ... Xizhan Gao
IEEE transactions on neural networks and learning systems | VOL. 34
Sijie Niu, et. al.Sijie Niu ... Xizhan Gao
01 Dec 2023
IEEE transactions on neural networks and learning systems | VOL. 34

Extract Generalization Ability from Convolutional Neural Networks
Huan Wu ... Jie Ding
-
Huan Wu, et. al.Huan Wu ... Jie Ding
01 Jul 2018
01 Jul 2018

BBS-RNN
Peiyan Dong
-
Peiyan DongPeiyan Dong
10 May 2021
10 May 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Combine-Net: An Improved Filter Pruning Algorithm

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information