PCA driven mixed filter pruning for efficient convNets

Waqas Ahmed,Shahab Ansari,Muhammad Hanif,Akhtar Khalil

doi:10.1371/journal.pone.0262386

Waqas Ahmed, Shahab Ansari + Show 2 more

Open Access

https://doi.org/10.1371/journal.pone.0262386

Copy DOI

Abstract

Deployment of the deep neural networks (DNNs) on resource-constrained devices is a challenging task due to their limited memory and computational power. In most cases, the pruning techniques do not prune the DNNs to full extent and redundancy still exists in these models. Considering this, a mixed filter pruning approach based on principal component analysis (PCA) and geometric median is presented. First, a pre-trained model is analyzed by using PCA to identify the important filters for every layer. These important filters are then used to reconstruct the network with a fewer number of layers and a fewer number of filters per layer. A new network with optimized structure is constructed and trained on the data. Secondly, the trained model is then analyzed using geometric median as a base. The redundant filters are identified and removed which results in further compression of the network. Finally, the pruned model is fine tuned to regain the accuracy. Experiments on CIFAR-10, CIFAR-100 and ILSVRC 2017 datasets show that the proposed mixed pruning approach is feasible and can compress the network to greater extent without any significant loss to accuracy. With VGG-16 on CIFAR-10, the number of operations and parameters are reduced to 18.56× and 3.33×, respectively, with almost 1% loss in the accuracy. The compression rate for AlexNet on CIFAR-10 dataset is 2.61× and 4.85× in terms of number of operations and number of parameters, respectively, with a gain of 1.2% in the accuracy. On CIFAR-100, VGG-19 is compressed by 16.02 X in terms of number of operations and 36× in terms of number of parameters with a 2.6% loss of accuracy. Similarly, the compression rate for VGG-19 network on ILSVRC 2017 dataset is 1.87× and 2.4× for operations and parameters with 0.5% loss in accuracy.

Highlights

Convolutional Neural Networks (CNNs) have achieved state of the art performance in many applications such as face recognition [1], object detection [2], semantic segmentation [3] and other classification tasks
The modern deep neural networks are computationally expensive and memory intensive and require more computational power for deployment and training, it has become a challenge to bring the advances in neural network technology to mobile devices
Much work has been done in recent years, focused on reducing the size of pre-trained neural networks, making them capable to be deployed on mobile devices for inferences [4, 5]

Summary

Introduction

Convolutional Neural Networks (CNNs) have achieved state of the art performance in many applications such as face recognition [1], object detection [2], semantic segmentation [3] and other classification tasks. Much work has been done in recent years, focused on reducing the size of pre-trained neural networks, making them capable to be deployed on mobile devices for inferences [4, 5] The latest architectures such as inception module [6] or residual connection [7] have millions of parameters which require extensive computation and storage power. These architectures produce state of the art accuracy and most of the designers start with pre-trained networks for transfer learning purposes. These networks are rarely evaluated on the given datasets and only the classifier is trained and fine-tuned. It is of great importance to devise deep neural network models with relatively low complexity and high accuracy

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS ONE	Publication Date: Jan 24, 2022
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

PCA driven mixed filter pruning for efficient convNets

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE

Lead the way for us

Similar Papers

Exploiting Retraining-Based Mixed-Precision Quantization for Low-Cost DNN Accelerator Design.
Nahsung Kim ... Wonseok Choi
IEEE Transactions on Neural Networks and Learning Systems | VOL. 32
Nahsung Kim, et. al.Nahsung Kim ... Wonseok Choi
03 Aug 2020
IEEE Transactions on Neural Networks and Learning Systems | VOL. 32

An information entropy-based filter pruning method for efficient ConvNets
Dongsheng Li ... Ye Zhao
-
Dongsheng Li, et. al.Dongsheng Li ... Ye Zhao
08 May 2023
08 May 2023

StructADMM: Achieving Ultrahigh Efficiency in Structured Pruning for DNNs
Tianyun Zhang ... Xue Lin
IEEE Transactions on Neural Networks and Learning Systems | VOL. 33
Tianyun Zhang, et. al.Tianyun Zhang ... Xue Lin
01 May 2022
IEEE Transactions on Neural Networks and Learning Systems | VOL. 33

Methodology to Adapt Neural Network on Constrained Device at Topology level
Logan Saint-Germain ... Christophe Jego
-
Logan Saint-Germain, et. al.Logan Saint-Germain ... Christophe Jego
02 Nov 2022
02 Nov 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PCA driven mixed filter pruning for efficient convNets

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE