The Possibility of Combining and Implementing Deep Neural Network Compression Methods

Bratislav Predić,Darjan Karabašević,Muzafer Saračević,Uroš Vukić,Dragiša Stanujkić

doi:10.3390/axioms11050229

Bratislav Predić, Darjan Karabašević + Show 3 more

Open Access

https://doi.org/10.3390/axioms11050229

Copy DOI

Abstract

In the paper, the possibility of combining deep neural network (DNN) model compression methods to achieve better compression results was considered. To compare the advantages and disadvantages of each method, all methods were applied to the ResNet18 model for pretraining to the NCT-CRC-HE-100K dataset while using CRC-VAL-HE-7K as the validation dataset. In the proposed method, quantization, pruning, weight clustering, QAT (quantization-aware training), preserve cluster QAT (hereinafter PCQAT), and distillation were performed for the compression of ResNet18. The final evaluation of the obtained models was carried out on a Raspberry Pi 4 device using the validation dataset. The greatest model compression result on the disk was achieved by applying the PCQAT method, whose application led to a reduction in size of the initial model by as much as 45 times, whereas the greatest model acceleration result was achieved via distillation on the MobileNetV2 model. All methods led to the compression of the initial size of the model, with a slight loss in the model accuracy or an increase in the model accuracy in the case of QAT and weight clustering. INT8 quantization and knowledge distillation also led to a significant decrease in the model execution time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Axioms	Publication Date: May 13, 2022
Citations: 12	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

The Possibility of Combining and Implementing Deep Neural Network Compression Methods

Abstract

Talk to us

Similar Papers

More From: Axioms

Lead the way for us

Similar Papers

A comparative evaluation of deep convolutional neural network and deep neural network-based land use/land cover classifications of mining regions using fused multi-sensor satellite data
Ajay Kumar ... Amit Kumar Gorai
Advances in Space Research | VOL. 72
Ajay Kumar, et. al.Ajay Kumar ... Amit Kumar Gorai
04 Sep 2023
Advances in Space Research | VOL. 72

High-performance and energy-efficient deep learning for resource-constrained devices
Ao Ren
-
Ao RenAo Ren
10 May 2021
10 May 2021

Object classification and visualization with edge artificial intelligence for a customized camera trap platform
Sajid Nazir ... Mohammad Kaleem
Ecological Informatics | VOL. 79
Sajid Nazir, et. al.Sajid Nazir ... Mohammad Kaleem
02 Jan 2024
Ecological Informatics | VOL. 79

Using Distillation to Improve Network Performance after Pruning and Quantization
Zhenshan Bao ... Jiayang Liu
-
Zhenshan Bao, et. al.Zhenshan Bao ... Jiayang Liu
18 Sep 2019
18 Sep 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Possibility of Combining and Implementing Deep Neural Network Compression Methods

Abstract

Talk to us

Similar Papers

More From: Axioms