Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset?

Jelena Nikolić,Zoran Perić,Stefan Tomić,Danijela Aleksić,Aleksandra Jovanović

doi:10.3390/e23121699

Abstract

Driven by the need for the compression of weights in neural networks (NNs), which is especially beneficial for edge devices with a constrained resource, and by the need to utilize the simplest possible quantization model, in this paper, we study the performance of three-bit post-training uniform quantization. The goal is to put various choices of the key parameter of the quantizer in question (support region threshold) in one place and provide a detailed overview of this choice’s impact on the performance of post-training quantization for the MNIST dataset. Specifically, we analyze whether it is possible to preserve the accuracy of the two NN models (MLP and CNN) to a great extent with the very simple three-bit uniform quantizer, regardless of the choice of the key parameter. Moreover, our goal is to answer the question of whether it is of the utmost importance in post-training three-bit uniform quantization, as it is in quantization, to determine the optimal support region threshold value of the quantizer to achieve some predefined accuracy of the quantized neural network (QNN). The results show that the choice of the support region threshold value of the three-bit uniform quantizer does not have such a strong impact on the accuracy of the QNNs, which is not the case with two-bit uniform post-training quantization, when applied in MLP for the same classification task. Accordingly, one can anticipate that due to this special property, the post-training quantization model in question can be greatly exploited.

Highlights

Neural networks (NNs) have achieved remarkable success in a wide range of realworld applications
We have shown that when three-bit uniform quantizer (UQ) is utilized for post-training quantization, the accuracies of two NNs (MLP and convolutional neural network (CNN)) that we have pretrained for the MNIST dataset can be preserved for various choices of the key parameter of the quantizer in question
We have shown that in post-training three-bit uniform quantization, for both NN models (MLP and CNN) and for two datasets (MNIST and Fashion-MNIST), it is not of utmost importance, as it is in classical quantization, to determine the optimal support region threshold value of the UQ to achieve some predefined accuracy of the quantized neural network (QNN)

Summary

Introduction

Neural networks (NNs) have achieved remarkable success in a wide range of realworld applications. Their application might be limited or impeded in edge devices with a constrained resource, such as IoT and mobile devices [1,2,3,4,5,6]. On such resource-constrained devices, decreased storage and/or computational costs for NNs are indispensable, the accuracy of NN can be severely degraded if the pathway toward this decrease is not chosen prudently [2,4,6]. Edge computing extends the cloud, making it as close as possible to heterogeneous end devices or end users [1,3]

Objectives

Methods

Results

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Entropy (Basel, Switzerland)	Publication Date: Dec 20, 2021
Citations: 6	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset?

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Entropy (Basel, Switzerland)

Lead the way for us

Similar Papers

Performance of Post-Training Two-Bits Uniform and Layer-Wise Uniform Quantization for MNIST Dataset from the Perspective of Support Region Choice
Stefan Tomić ... Danijela Aleksić
Mathematical Problems in Engineering | VOL. 2022
Stefan Tomić, et. al.Stefan Tomić ... Danijela Aleksić
07 Apr 2022
Mathematical Problems in Engineering | VOL. 2022

Two Novel Non-Uniform Quantizers with Application in Post-Training Quantization
Zoran Perić ... Stefan Tomić
Mathematics | VOL. 10
Zoran Perić, et. al.Zoran Perić ... Stefan Tomić
21 Sep 2022
Mathematics | VOL. 10

Accuracy degradation aware bit rate allocation for layer-wise uniform quantization of weights in neural network
Jelena Nikolić ... Danijela Aleksić
Journal of Electrical Engineering | VOL. 75
Jelena Nikolić, et. al.Jelena Nikolić ... Danijela Aleksić
01 Dec 2024
Journal of Electrical Engineering | VOL. 75

Quantum Neural Network for Image Classification Using TensorFlow Quantum
J Arun Pandian ... K Kanchanadevi
-
J Arun Pandian, et. al.J Arun Pandian ... K Kanchanadevi
01 Jan 2023
01 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset?

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Entropy (Basel, Switzerland)