FLOPs-efficient filter pruning via transfer scale for neural network acceleration

Zhixin Guo,Yifan Xiao,Wenzhi Liao,Peter Veelaert,Wilfried Philips

doi:10.1016/j.jocs.2021.101459

Abstract

Model pruning is a useful technique to reduce the computational cost of convolutional neural networks. In this paper, we first propose a simple but effective filter level pruning criterion, which assesses the importance of a filter by exploring the transfer scale (TS) of its feature maps in the next layer. The principle is that for a trained CNN model, an important filter should have strong connections with the next layer, otherwise the transfer scale of its feature map will be low and hence removing it will have little influence on the network. Besides, we observe that filters from the computationally-intensive layers are more sensitive to pruning, which makes it difficult to further compress the floating-point operations (FLOPs) of the model without reducing accuracy. To solve this problem, we propose a FLOPs-efficient group Lasso approach for TS to guide the network to use fewer filters in the computationally-intensive layers, which leads to better FLOPs compression performance after pruning. We refer to the proposed method as FETS. Compared with the state-of-the-art methods, our FETS achieves similar or better accuracy, but with significantly larger FLOPs compression ratio. In particular, with VGG-16, ResNet-56 and DenseNet-40 on CIFAR-10, we achieve similar or better accuracies than other methods, with only 48%, 64% and 58% of the FLOPs. With ResNet-50 on ImageNet, we also achieve a relative FLOPs reduction of 30%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

FLOPs-efficient filter pruning via transfer scale for neural network acceleration

Abstract

Talk to us

Similar Papers

More From: Journal of Computational Science

Lead the way for us

Journal: Journal of Computational Science	Publication Date: Oct 1, 2021
Citations: 3

Similar Papers

Accelerating Deep Unsupervised Domain Adaptation with Transfer Channel Pruning
Chaohui Yu ... Jindong Wang
-
Chaohui Yu, et. al.Chaohui Yu ... Jindong Wang
01 Jul 2019
01 Jul 2019

Latency-aware automatic CNN channel pruning with GPU runtime analysis
Jiaqiang Liu ... Guangzhong Sun
BenchCouncil Transactions on Benchmarks, Standards and Evaluations | VOL. 1
Jiaqiang Liu, et. al.Jiaqiang Liu ... Guangzhong Sun
01 Oct 2021
BenchCouncil Transactions on Benchmarks, Standards and Evaluations | VOL. 1

Expressive power of ReLU and step networks under floating-point operations
Yeachan Park ... Sejun Park
Neural Networks | VOL. 175
Yeachan Park, et. al.Yeachan Park ... Sejun Park
09 Apr 2024
Neural Networks | VOL. 175

Convolutional Neural Networks for Visual Information Analysis with Limited Computing Resources
Paraskevi Nousi ... Emmanouil Patsiouras
-
Paraskevi Nousi, et. al.Paraskevi Nousi ... Emmanouil Patsiouras
01 Oct 2018
01 Oct 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

FLOPs-efficient filter pruning via transfer scale for neural network acceleration

Abstract

Talk to us

Similar Papers

More From: Journal of Computational Science