Binary Quantization Analysis of Neural Networks Weights on MNIST Dataset

Zoran H Peric,Milan S Savic,Nikola J Vucic,Nikola B Simic,Bojan D Denic

doi:10.5755/j02.eie.28881

Zoran H Peric, Milan S Savic + Show 3 more

Open Access

https://doi.org/10.5755/j02.eie.28881

Copy DOI

Abstract

This paper considers the design of a binary scalar quantizer of Laplacian source and its application in compressed neural networks. The quantizer performance is investigated in a wide dynamic range of data variances, and for that purpose, we derive novel closed-form expressions. Moreover, we propose two selection criteria for the variance range of interest. Binary quantizers are further implemented for compressing neural network weights and its performance is analysed for a simple classification task. Good matching between theory and experiment is observed and a great possibility for implementation is indicated.

Highlights

Artificial neural networks (NNs) have become an attractive research field in recent decades for resolving different challenges due to the increasing availability of powerful hardware [1]
1Abstract—This paper considers the design of a binary scalar quantizer of Laplacian source and its application in compressed neural networks
The goal of the section is to verify the theoretical analysis provided in previous Section III by applying a binary quantizer in processing the weights of NN

Summary

Introduction

Artificial neural networks (NNs) have become an attractive research field in recent decades for resolving different challenges due to the increasing availability of powerful hardware [1]. It is worth mentioning that the most significant achievements have been provided in tasks, such as image classification [2], object recognition [3], and speech processing [4]. The application in other fields has been performed, where some promising results have been achieved [5]–[7]. The improved performance (i.e., high prediction accuracy level) has often been provided using very complex NN architectures, with a large amount of parameters, computational and storage resources. This in turn can be a limiting factor for the application of NNs in portable and edge computing devices with limited memory and processing power, or in latency-critical services. NN parameters (weights, activations, etc.), usually represented in 32-bits floating point format (full precision), are mapped to fixed-point representations using lower bit lengths

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Elektronika ir Elektrotechnika	Publication Date: Aug 23, 2021
Citations: 6	License type: cc-by

R Discovery Prime

R Discovery Prime

Binary Quantization Analysis of Neural Networks Weights on MNIST Dataset

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Elektronika ir Elektrotechnika

Lead the way for us

Similar Papers

Performance of Post-Training Two-Bits Uniform and Layer-Wise Uniform Quantization for MNIST Dataset from the Perspective of Support Region Choice
Stefan Tomić ... Hao Gao
Mathematical Problems in Engineering | VOL. 2022
Stefan Tomić, et. al.Stefan Tomić ... Hao Gao
07 Apr 2022
Mathematical Problems in Engineering | VOL. 2022

Quantization of Weights of Neural Networks with Negligible Decreasing of Prediction Accuracy
Zoran Peric ... Darko Mihajlov
Information Technology and Control | VOL. 50
Zoran Peric, et. al.Zoran Peric ... Darko Mihajlov
24 Sep 2021
Information Technology and Control | VOL. 50

Artificial neural networks for photonic applications—from algorithms to implementation: tutorial
Pedro Freire ... Jaroslaw E Prilepsky
Advances in Optics and Photonics | VOL. 15
Pedro Freire, et. al.Pedro Freire ... Jaroslaw E Prilepsky
22 Sep 2023
Advances in Optics and Photonics | VOL. 15

Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset?
Jelena Nikolić ... Zoran Perić
Entropy (Basel, Switzerland) | VOL. 23
Jelena Nikolić, et. al.Jelena Nikolić ... Zoran Perić
20 Dec 2021
Entropy (Basel, Switzerland) | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Binary Quantization Analysis of Neural Networks Weights on MNIST Dataset

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Elektronika ir Elektrotechnika