Matching the Ideal Pruning Method with Knowledge Distillation for Optimal Compression

Leila Malihi,Gunther Heidemann

doi:10.3390/asi7040056

Abstract

In recent years, model compression techniques have gained significant attention as a means to reduce the computational and memory requirements of deep neural networks. Knowledge distillation and pruning are two prominent approaches in this domain, each offering unique advantages in achieving model efficiency. This paper investigates the combined effects of knowledge distillation and two pruning strategies, weight pruning and channel pruning, on enhancing compression efficiency and model performance. The study introduces a metric called “Performance Efficiency” to evaluate the impact of these pruning strategies on model compression and performance. Our research is conducted on the popular datasets CIFAR-10 and CIFAR-100. We compared diverse model architectures, including ResNet, DenseNet, EfficientNet, and MobileNet. The results emphasize the efficacy of both weight and channel pruning in achieving model compression. However, a significant distinction emerges, with weight pruning showing superior performance across all four architecture types. We realized that the weight pruning method better adapts to knowledge distillation than channel pruning. Pruned models show a significant reduction in parameters without a significant reduction in accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied System Innovation	Publication Date: Jun 29, 2024
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Matching the Ideal Pruning Method with Knowledge Distillation for Optimal Compression

Abstract

Talk to us

Similar Papers

More From: Applied System Innovation

Lead the way for us

Similar Papers

Research On Pruning Methods for Mobilenet Convolutional Neural Network
Yunhuan Zou
Highlights in Science, Engineering and Technology | VOL. 81
Yunhuan ZouYunhuan Zou
26 Jan 2024
Highlights in Science, Engineering and Technology | VOL. 81

Hand detection model compression based on channel pruning and knowledge distillation
Haiyan Chen ... Wenli Zhao
-
Haiyan Chen, et. al.Haiyan Chen ... Wenli Zhao
25 May 2023
25 May 2023

An Efficient Channel-level Pruning for CNNs without Fine-tuning
Zhongtian Xu ... Jingwei Sun
-
Zhongtian Xu, et. al.Zhongtian Xu ... Jingwei Sun
18 Jul 2021
18 Jul 2021

GDP: Stabilized Neural Network Pruning via Gates with Differentiable Polarization
Yi Guo ... Zhangyang Wang
-
Yi Guo, et. al.Yi Guo ... Zhangyang Wang
01 Oct 2021
01 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Matching the Ideal Pruning Method with Knowledge Distillation for Optimal Compression

Abstract

Talk to us

Similar Papers

More From: Applied System Innovation