1xN Pattern for Pruning Convolutional Neural Networks.

Mingbao Lin,Shen Li,Yonghong Tian,Bohong Chen,Yuxin Zhang,Mengdi Wang,Rongrong Ji,Fei Chao,Yuchao Li

doi:10.1109/tpami.2022.3195774

Abstract

Though network pruning receives popularity in reducing the complexity of convolutional neural networks (CNNs), it remains an open issue to concurrently maintain model accuracy as well as achieve significant speedups on general CPUs. In this paper, we propose a novel 1×N pruning pattern to break this limitation. In particular, consecutive N output kernels with the same input channel index are grouped into one block, which serves as a basic pruning granularity of our pruning pattern. Our 1×N pattern prunes these blocks considered unimportant. We also provide a workflow of filter rearrangement that first rearranges the weight matrix in the output channel dimension to derive more influential blocks for accuracy improvements and then applies similar rearrangement to the next-layer weights in the input channel dimension to ensure correct convolutional operations. Moreover, the output computation after our 1×N pruning can be realized via a parallelized block-wise vectorized operation, leading to significant speedups on general CPUs. The efficacy of our pruning pattern is proved with experiments on ILSVRC-2012. For example, given the pruning rate of 50% and N=4, our pattern obtains about 3.0% improvements over filter pruning in the top-1 accuracy of MobileNet-V2. Meanwhile, it obtains 56.04ms inference savings on Cortex-A7 CPU over weight pruning. Our project is made available at https://github.com/lmbxmu/1xN.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

1xN Pattern for Pruning Convolutional Neural Networks.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence

Lead the way for us

Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence	Publication Date: Jan 1, 2022
Citations: 27

Similar Papers

Thinning of convolutional neural network with mixed pruning
Wenzhu Yang ... Liping Chen
IET Image Processing | VOL. 13
Wenzhu Yang, et. al.Wenzhu Yang ... Liping Chen
20 Mar 2019
IET Image Processing | VOL. 13

Cluster Pruning: An Efficient Filter Pruning Method for Edge AI Vision Applications
Chinthaka Gamanayake ... Lahiru Jayasinghe
IEEE Journal of Selected Topics in Signal Processing | VOL. 14
Chinthaka Gamanayake, et. al.Chinthaka Gamanayake ... Lahiru Jayasinghe
05 Mar 2020
IEEE Journal of Selected Topics in Signal Processing | VOL. 14

DASH: Design Automation for Synthesis and Hardware Generation for CNN
Arish Sateesan ... Smitha K G
-
Arish Sateesan, et. al.Arish Sateesan ... Smitha K G
01 Dec 2020
01 Dec 2020

Energy Complexity of Convolutional Neural Networks.
Jiří Šíma ... Vojtěch Mrázek
Neural computation | VOL. 36
Jiří Šíma, et. al.Jiří Šíma ... Vojtěch Mrázek
20 May 2024
Neural computation | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

1xN Pattern for Pruning Convolutional Neural Networks.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence