Efficient Quantization Techniques for Deep Neural Networks

Chutian Jiang

doi:10.1109/conf-spml54095.2021.00059

Abstract

As model prediction becomes more and more accurate and the network becomes deeper and deeper, the amount of memory consumed by the neural network becomes a problem, especially on mobile devices. It is also very difficult to balance the tradeoff between computational cost and battery life, which makes mobile devices very hard as well to become smarter. Model quantification techniques provide the opportunity to tackle this tradeoff by reducing the memory bandwidth and storage and improving the system throughput and latency. This paper discusses and compares the state-of-the-art methods of neural network quantification methodologies including Post Training Quantization (PTQ) and Quantization Aware Training (QAT). PTQ directly quantizes the trained floating-point model. The implementation process is simple and does not require quantization during the training phase. QAT requires us to use simulated quantization operations to model the effect of the quantization, and forward and backward passes are usually performed in the floating-point model. Finally, as discussed in the experiments in this paper, we conclude that with the evolution of the quantization techniques, the accuracy gap between PTQ and QAT is shrinking.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient Quantization Techniques for Deep Neural Networks

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Optimizing convolutional neural networks for IoT devices: performance and energy efficiency of quantization techniques
Nicolás Hernández ... Vicente Blanco
The Journal of Supercomputing | VOL. 80
Nicolás Hernández, et. al.Nicolás Hernández ... Vicente Blanco
20 Feb 2024
The Journal of Supercomputing | VOL. 80

Quantizing Neural Networks for Low-Power Computer Vision
Marios Fournarakis ... Tijmen Blankevoort
-
Marios Fournarakis, et. al.Marios Fournarakis ... Tijmen Blankevoort
12 Jan 2022
12 Jan 2022

Attention Round for post-training quantization
Huabin Diao ... Wei Wang
Neurocomputing | VOL. 565
Huabin Diao, et. al.Huabin Diao ... Wei Wang
10 Nov 2023
Neurocomputing | VOL. 565

Fixed-point Quantization for Vision Transformer
Zhexin Li ... Zhiyuan Wang
-
Zhexin Li, et. al.Zhexin Li ... Zhiyuan Wang
22 Oct 2021
22 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient Quantization Techniques for Deep Neural Networks

Abstract

Talk to us

Similar Papers