HEMP: High-order entropy minimization for neural network compression

Enzo Tartaglione,Stéphane Lathuilière,Attilio Fiandrotti,Marco Cagnazzo,Marco Grangetto

doi:10.1016/j.neucom.2021.07.022

Enzo Tartaglione, Stéphane Lathuilière + Show 3 more

Open Access

https://doi.org/10.1016/j.neucom.2021.07.022

Copy DOI

Abstract

We formulate the entropy of a quantized artificial neural network as a differentiable function that can be plugged as a regularization term into the cost function minimized by gradient descent. Our formulation scales efficiently beyond the first order and is agnostic of the quantization scheme. The network can then be trained to minimize the entropy of the quantized parameters, so that they can be optimally compressed via entropy coding. We experiment with our entropy formulation at quantizing and compressing well-known network architectures over multiple datasets. Our approach compares favorably over similar methods, enjoying the benefits of higher order entropy estimate, showing flexibility towards non-uniform quantization (we use Lloyd-max quantization), scalability towards any entropy order to be minimized and efficiency in terms of compression. We show that HEMP is able to work in synergy with other approaches aiming at pruning or quantizing the model itself, delivering significant benefits in terms of storage size compressibility without harming the model’s performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

HEMP: High-order entropy minimization for neural network compression

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Jul 10, 2021
Citations: 6

Similar Papers

GPU-Intensive Fast Entropy Coding Framework for Neural Image Compression
Hiroaki Akutsu ... Takahiro Naruko
-
Hiroaki Akutsu, et. al.Hiroaki Akutsu ... Takahiro Naruko
05 Dec 2021
05 Dec 2021

Gradient Descent for Non-convex Problems in Modern Machine Learning

-

27 Jun 2019
27 Jun 2019

Neural networks based non-uniform scalar quantizer design with particle swarm optimization
Wenwei Zha ... G.K Venayagamoorthy
-
Wenwei Zha, et. al. Wenwei Zha ... G.K Venayagamoorthy
08 Jun 2005
08 Jun 2005

Linearization of Non-Uniform Quantizers via Adaptive Non-Subtractive Dithering
Morriel Kasher ... Predrag Spasojevic
-
Morriel Kasher, et. al.Morriel Kasher ... Predrag Spasojevic
22 Mar 2023
22 Mar 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

HEMP: High-order entropy minimization for neural network compression

Abstract

Talk to us

Similar Papers

More From: Neurocomputing