Discrimination-Aware Network Pruning for Deep Model Compression.

Jing Liu,Jinhui Zhu,Junzhou Huang,Bohan Zhuang,Mingkui Tan,Zhuangwei Zhuang,Yong Guo

doi:10.1109/tpami.2021.3066410

Abstract

We study network pruning which aims to remove redundant channels/kernels and hence speed up the inference of deep networks. Existing pruning methods either train from scratch with sparsity constraints or minimize the reconstruction error between the feature maps of the pre-trained models and the compressed ones. Both strategies suffer from some limitations: the former kind is computationally expensive and difficult to converge, while the latter kind optimizes the reconstruction error but ignores the discriminative power of channels. In this paper, we propose a simple-yet-effective method called discrimination-aware channel pruning (DCP) to choose the channels that actually contribute to the discriminative power. To this end, we first introduce additional discrimination-aware losses into the network to increase the discriminative power of the intermediate layers. Next, we select the most discriminative channels for each layer by considering the discrimination-aware loss and the reconstruction error, simultaneously. We then formulate channel pruning as a sparsity-inducing optimization problem with a convex objective and propose a greedy algorithm to solve the resultant problem. Note that a channel (3D tensor) often consists of a set of kernels (each with a 2D matrix). Besides the redundancy in channels, some kernels in a channel may also be redundant and fail to contribute to the discriminative power of the network, resulting in kernel level redundancy. To solve this issue, we propose a discrimination-aware kernel pruning (DKP) method to further compress deep networks by removing redundant kernels. To avoid manually determining the pruning rate for each layer, we propose two adaptive stopping conditions to automatically determine the number of selected channels/kernels. The proposed adaptive stopping conditions tend to yield more efficient models with better performance in practice. Extensive experiments on both image classification and face recognition demonstrate the effectiveness of our methods. For example, on ILSVRC-12, the resultant ResNet-50 model with 30 percent reduction of channels even outperforms the baseline model by 0.36 percent in terms of Top-1 accuracy. We also deploy the pruned models on a smartphone (equipped with a Qualcomm Snapdragon 845 processor). The pruned MobileNetV1 and MobileNetV2 achieve 1.93× and 1.42× inference acceleration on the mobile device, respectively, with negligible performance degradation. The source code and the pre-trained models are available at https://github.com/SCUT-AILab/DCP.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Discrimination-Aware Network Pruning for Deep Model Compression.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence

Lead the way for us

Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence	Publication Date: Jan 1, 2021
Citations: 45

Similar Papers

Influence Function Based Second-Order Channel Pruning: Evaluating True Loss Changes For Pruning Is Possible Without Retraining.
Hongrong Cheng ... Miao Zhang
IEEE transactions on pattern analysis and machine intelligence | VOL. PP
Hongrong Cheng, et. al.Hongrong Cheng ... Miao Zhang
01 Jan 2024
IEEE transactions on pattern analysis and machine intelligence | VOL. PP

CHEX: CHannel EXploration for CNN Model Compression
Zejiang Hou ... Yuan Xie
-
Zejiang Hou, et. al.Zejiang Hou ... Yuan Xie
01 Jun 2022
01 Jun 2022

Dynamical Channel Pruning by Conditional Accuracy Change for Deep Neural Networks.
Zhiqiang Chen ... Ting-Bing Xu
IEEE Transactions on Neural Networks and Learning Systems | VOL. 32
Zhiqiang Chen, et. al.Zhiqiang Chen ... Ting-Bing Xu
01 Feb 2021
IEEE Transactions on Neural Networks and Learning Systems | VOL. 32

Lossless Reconstruction of Convolutional Neural Network for Channel-Based Network Pruning.
Donghyeon Lee ... Youngbae Hwang
Sensors (Basel, Switzerland) | VOL. 23
Donghyeon Lee, et. al.Donghyeon Lee ... Youngbae Hwang
13 Feb 2023
Sensors (Basel, Switzerland) | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Discrimination-Aware Network Pruning for Deep Model Compression.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence