Training generalizable quantized deep neural nets

Charles Hernandez,Bijan Taslimi,Hung Yi Lee,Hongcheng Liu,Panos M Pardalos

doi:10.1016/j.eswa.2022.118736

Abstract

While a number of practical methods for training quantized DL models have been presented in the literature, there exists a critical gap in the theoretical generalizability results for such approaches. Although empirical evidence often suggests a high tolerance of DL architectures to variations of training procedures, existing theoretical generalization analyses are often contingent on the specific designs of training algorithms, e.g., in stochastic gradient descent (SGD). This specialization makes such generalizability results inapplicable to the case of quantized DL models. In view of this critical vacuum, this paper provides several almost-algorithm-independent results to ensure the generalizability of a quantized neural network at different levels of optimality. These results include the characterizations of a computable, quantized local solution that ensures the generalization performance and an algorithm that is provably convergent to such a local solution.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Training generalizable quantized deep neural nets

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications

Lead the way for us

Similar Papers

Stochastic Natural Gradient Descent by estimation of empirical covariances
Luigi Malago ... Giovanni Pistone
-
Luigi Malago, et. al.Luigi Malago ... Giovanni Pistone
01 Jun 2011
01 Jun 2011

On the Insufficiency of Existing Momentum Schemes for Stochastic Optimization
Rahul Kidambi ... Prateek Jain
-
Rahul Kidambi, et. al.Rahul Kidambi ... Prateek Jain
01 Feb 2018
01 Feb 2018

The Implementation of Gradient Descent Based Methods Using Parallel Computing in R for Regression Tasks
Lala Septem Riza ... Muhammad Aziz Ashari
-
Lala Septem Riza, et. al.Lala Septem Riza ... Muhammad Aziz Ashari
01 Aug 2018
01 Aug 2018

To regularize or not: Revisiting SGD with simple algorithms and experimental studies
Wenwu He ... Yang Liu
Expert Systems with Applications | VOL. 112
Wenwu He, et. al.Wenwu He ... Yang Liu
15 Jun 2018
Expert Systems with Applications | VOL. 112

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Training generalizable quantized deep neural nets

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications