Cross-Entropy Pruning for Compressing Convolutional Neural Networks.

Rongxin Bao,Xu Yuan,Zhikui Chen,Ruixin Ma

doi:10.1162/neco_a_01131

Abstract

The success of CNNs is accompanied by deep models and heavy storage costs. For compressing CNNs, we propose an efficient and robust pruning approach, cross-entropy pruning (CEP). Given a trained CNN model, connections were divided into groups in a group-wise way according to their corresponding output neurons. All connections with their cross-entropy errors below a grouping threshold were then removed. A sparse model was obtained and the number of parameters in the baseline model significantly reduced. This letter also presents a highest cross-entropy pruning (HCEP) method that keeps a small portion of weights with the highest CEP. This method further improves the accuracy of CEP. To validate CEP, we conducted the experiments on low redundant networks that are hard to compress. For the MNIST data set, CEP achieves an 0.08% accuracy drop required by LeNet-5 benchmark with only 16% of original parameters. Our proposed CEP also reduces approximately 75% of the storage cost of AlexNet on the ILSVRC 2012 data set, increasing the top-1 errorby only 0.4% and top-5 error by only 0.2%. Compared with three existing methods on LeNet-5, our proposed CEP and HCEP perform significantly better than the existing methods in terms of the accuracy and stability. Some computer vision tasks on CNNs such as object detection and style transfer can be computed in a high-performance way using our CEP and HCEP strategies.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Cross-Entropy Pruning for Compressing Convolutional Neural Networks.

Abstract

Talk to us

Similar Papers

More From: Neural computation

Lead the way for us

Journal: Neural computation	Publication Date: Sep 14, 2018
Citations: 9

Similar Papers

Video Object Detection with an Improved Classification Approach
Sita Yadav ... Sandeep M Chaware
-
Sita Yadav, et. al.Sita Yadav ... Sandeep M Chaware
01 Jan 2023
01 Jan 2023

Realistic Style-Transfer Generative Adversarial Network With a Weight-Sharing Strategy
Shixiong Zhu ... Han Zhang
-
Shixiong Zhu, et. al.Shixiong Zhu ... Han Zhang
01 Nov 2020
01 Nov 2020

Exploiting Depth From Single Monocular Images for Object Detection and Semantic Segmentation.
Yuanzhouhan Cao ... Chunhua Shen
IEEE Transactions on Image Processing | VOL. 26
Yuanzhouhan Cao, et. al.Yuanzhouhan Cao ... Chunhua Shen
26 Oct 2016
IEEE Transactions on Image Processing | VOL. 26

Model distillation for high-level semantic understanding：a survey
Ruoyu Sun ... Hongkai Xiong
Journal of Image and Graphics | VOL. 28
Ruoyu Sun, et. al.Ruoyu Sun ... Hongkai Xiong
01 Jan 2023
Journal of Image and Graphics | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cross-Entropy Pruning for Compressing Convolutional Neural Networks.

Abstract

Talk to us

Similar Papers

More From: Neural computation