A simple approach for quantizing neural networks

Johannes Maly,Rayan Saab

doi:10.1016/j.acha.2023.04.004

Abstract

In this short note, we propose a new method for quantizing the weights of a fully trained neural network. A simple deterministic pre-processing step allows us to quantize network layers via memoryless scalar quantization while preserving the network performance on given training data. On one hand, the computational complexity of this pre-processing slightly exceeds that of state-of-the-art algorithms in the literature. On the other hand, our approach does not require any hyper-parameter tuning and, in contrast to previous methods, allows a plain analysis. We provide rigorous theoretical guarantees in the case of quantizing single network layers and show that the relative error decays with the number of parameters in the network if the training data behave well, e.g., if it is sampled from suitable random distributions. The developed method also readily allows the quantization of deep networks by consecutive application to single layers.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A simple approach for quantizing neural networks

Abstract

Talk to us

Similar Papers

More From: Applied and Computational Harmonic Analysis

Lead the way for us

Journal: Applied and Computational Harmonic Analysis	Publication Date: Sep 1, 2023
Citations: 1

Similar Papers

Comparative Analysis of Neural Network and Linear Regression Applied to Black Friday Data

-

27 Jan 2020
27 Jan 2020

Robustness analysis of neural networks with an application to a neuro-controller problem
K Krishnakumar ... K Nichita
-
K Krishnakumar, et. al.K Krishnakumar ... K Nichita
29 Jul 1996
29 Jul 1996

Quantitative Service Reliability Assessment on Single and Multi Layer Networks
Harshit Pandey ... Cher Ming Tan
-
Harshit Pandey, et. al.Harshit Pandey ... Cher Ming Tan
01 Jan 2019
01 Jan 2019

CORR Synthesis: When Should the Orthopaedic Surgeon Use Artificial Intelligence, Machine Learning, and Deep Learning?
Michael P Murphy ... Nicholas M Brown
Clinical orthopaedics and related research | VOL. 479
Michael P Murphy, et. al.Michael P Murphy ... Nicholas M Brown
17 Feb 2021
Clinical orthopaedics and related research | VOL. 479

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A simple approach for quantizing neural networks

Abstract

Talk to us

Similar Papers

More From: Applied and Computational Harmonic Analysis