Quantizing Neural Networks for Low-Power Computer Vision

Marios Fournarakis,Mart Van Baalen,Yelysei Bondarenko,Rana Ali Amjad,Markus Nagel,Tijmen Blankevoort

doi:10.1201/9781003162810-11

Abstract

Over the last years, Neural Networks (NNs) have been widely adapted in Computer Vision (CV) applications. While for many tasks they outperform traditional CV algorithms they often come at a high compute cost. Even mobile friendly architectures such as MobileNet still require hundreds of million floating point operations. To further reduce the energy efficiency and latency of NNs, quantization can be used to replace the original floating-point operations with low bit fixed-point operations. In this chapter we introduce NN quantization for low-power computer vision. Afterward we highlight recent advances in post-training quantization, a class of algorithms that can be applied to pretrained NNs and do not require any expert knowledge. In the last part we will focus on quantization-aware training, a technique that trains NNs with simulated quantization operations. Take-aways Introduces neural network quantization Serves as a practical guide to quantization simulation with HW considerations Introduces state-of-the-art post-training quantization (PTQ) techniques that are easy to use. Introduces state-of-the-art quantization-aware training (QAT) approaches that result in best performance. Defines standard PTQ and QAT pipeline and evaluates them on several computer vision models and tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Quantizing Neural Networks for Low-Power Computer Vision

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Efficient Quantization Techniques for Deep Neural Networks
Chutian Jiang
-
Chutian JiangChutian Jiang
01 Nov 2021
01 Nov 2021

Performance Improvements in Quantization Aware Training and Appreciation of Low Precision Computation in Deep Learning
Uday Kulkarni ... Kunal Jadhav
-
Uday Kulkarni, et. al.Uday Kulkarni ... Kunal Jadhav
01 Jan 2020
01 Jan 2020

Optimizing convolutional neural networks for IoT devices: performance and energy efficiency of quantization techniques
Nicolás Hernández ... Vicente Blanco
The Journal of Supercomputing | VOL. 80
Nicolás Hernández, et. al.Nicolás Hernández ... Vicente Blanco
20 Feb 2024
The Journal of Supercomputing | VOL. 80

Attention Round for post-training quantization
Huabin Diao ... Wei Wang
Neurocomputing | VOL. 565
Huabin Diao, et. al.Huabin Diao ... Wei Wang
10 Nov 2023
Neurocomputing | VOL. 565

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Quantizing Neural Networks for Low-Power Computer Vision

Abstract

Talk to us

Similar Papers