CNQ: Compressor‐Based Non‐uniform Quantization of Deep Neural Networks

Yong Yuan,Xiyuan Hu,Chen Chen,Silong Peng

doi:10.1049/cje.2020.09.014

Abstract

Deep neural networks (DNNs) have achieved state-of-the-art performance in a number of domains but suffer intensive complexity. Network quantization can effectively reduce computation and memory costs without changing network structure, facilitating the deployment of DNNs on mobile devices. While the existing methods can obtain good performance, low-bit quantization without time-consuming training or access to the full dataset is still a challenging problem. In this paper, we develop a novel method named Compressorbased non-uniform quantization (CNQ) method to achieve non-uniform quantization of DNNs with few unlabeled samples. Firstly, we present a compressor-based fast nonuniform quantization method, which can accomplish nonuniform quantization without iterations. Secondly, we propose to align the feature maps of the quantization model with the pre-trained model for accuracy recovery. Considering the property difference between different activation channels, we utilize the weighted-entropy perchannel to optimize the alignment loss. In the experiments, we evaluate the proposed method on image classification and object detection. Our results outperform the existing post-training quantization methods, which demonstrate the effectiveness of the proposed method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CNQ: Compressor‐Based Non‐uniform Quantization of Deep Neural Networks

Abstract

Talk to us

Similar Papers

More From: Chinese Journal of Electronics

Lead the way for us

Journal: Chinese Journal of Electronics	Publication Date: Nov 1, 2020
Citations: 1

Similar Papers

Efficient Quantization Method for Biometric Fingerprint Image Compression
B Emmanuel ... S Sani
British Journal of Mathematics & Computer Science | VOL. 10
B Emmanuel, et. al.B Emmanuel ... S Sani
10 Jan 2015
British Journal of Mathematics & Computer Science | VOL. 10

A Comprehensive Survey on Model Quantization for Deep Neural Networks in Image Classification
Babak Rokh ... Alireza Khanteymoori
ACM Transactions on Intelligent Systems and Technology | VOL. 14
Babak Rokh, et. al.Babak Rokh ... Alireza Khanteymoori
14 Nov 2023
ACM Transactions on Intelligent Systems and Technology | VOL. 14

Fast Non-Uniform Quantization of Neural Networks
Yuan Gao ... Chen Zhao
-
Yuan Gao, et. al.Yuan Gao ... Chen Zhao
22 Apr 2022
22 Apr 2022

Quantization and Deployment of Deep Neural Networks on Microcontrollers.
Pierre-Emmanuel Novac ... Ghouthi Boukli Hacene
Sensors | VOL. 21
Pierre-Emmanuel Novac, et. al.Pierre-Emmanuel Novac ... Ghouthi Boukli Hacene
23 Apr 2021
Sensors | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CNQ: Compressor‐Based Non‐uniform Quantization of Deep Neural Networks

Abstract

Talk to us

Similar Papers

More From: Chinese Journal of Electronics