Training Multi-Bit Quantized and Binarized Networks with a Learnable Symmetric Quantizer

Phuoc Pham,Jacob A Abraham,Jaeyong Chung

doi:10.1109/access.2021.3067889

Phuoc Pham, Jacob A Abraham + Show 1 more

Open Access

PDF Available

https://doi.org/10.1109/access.2021.3067889

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Quantizing weights and activations of deep neural networks is essential for deploying them in resource-constrained devices, or cloud platforms for at-scale services. While binarization is a special case of quantization, this extreme case often leads to several training difficulties, and necessitates specialized models and training methods. As a result, recent quantization methods do not provide binarization, thus losing the most resource-efficient option, and quantized and binarized networks have been distinct research areas. We examine binarization difficulties in a quantization framework and find that all we need to enable the binary training are a symmetric quantizer, good initialization, and careful hyperparameter selection. These techniques also lead to substantial improvements in multi-bit quantization. We demonstrate our unified quantization framework, denoted as UniQ, on the ImageNet dataset with various architectures such as ResNet-18,-34 and MobileNetV2. For multi-bit quantization, UniQ outperforms existing methods to achieve the state-of-the-art accuracy. In binarization, the achieved accuracy is comparable to existing state-of-the-art methods even without modifying the original architectures.

Highlights

Deep neural networks have achieved tremendous success in various fields including computer vision [31], natural language processing [52], and speech recognition [8], having demonstrated unprecedented predictive performance
EXPERIMENT RESULTS To demonstrate the effectiveness of our proposed method, we evaluate it on the CIFAR-100 [30] and the ImageNet datasets [45]
The experiment results are compared with various recent works on multi-bit quantization and neural network binarization

Summary

INTRODUCTION

Deep neural networks have achieved tremendous success in various fields including computer vision [31], natural language processing [52], and speech recognition [8], having demonstrated unprecedented predictive performance. The signed integer is assumed to be represented by two’s complement, which has asymmetric ranges This quantization method does not transform the weights of the pre-trained models. According to our ablation study, our proposed framework shows significant improvements over prior works as a combined result of the symmetric quantizer and the optimal initialization. We scrutinize the training dynamics of the binary case and find that the binary case receives strong gradient signals at the beginning of training and the distribution of quantizer input changes extremely fast compared to multi-bit cases We hypothesize that this difference is caused as the initial point after binarization is too far from the pre-trained model solution. Architectures, meaning that our method can be used in conjunction with network modification techniques. We propose an optimal, analytic initialization for step sizes

RELATED WORK

OPTIMAL MSE INITIALIZATION

EXPERIMENT RESULTS

CONCLUSION

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 12	License type: CC BY 4.0

R Discovery Prime

Training Multi-Bit Quantized and Binarized Networks with a Learnable Symmetric Quantizer

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

THE DEVELOPMENT AND CHANGES OF TRIPLE JUMPERS’ BALANCE BOARD COMBINED WITH SPECIAL TRAINING BY DEEP LEARNING APPROACH
Hai Wang ... Yongchun Tao
Journal of Mechanics in Medicine and Biology | VOL. -
Hai Wang, et. al.Hai Wang ... Yongchun Tao
25 Jan 2023
Journal of Mechanics in Medicine and Biology | VOL. -

Human Computer Interface (HCI) using EEG signals.
Eric Ker Siang Siow ... Hou Kit Mun
Journal of Physics: Conference Series | VOL. 2523
Eric Ker Siang Siow, et. al.Eric Ker Siang Siow ... Hou Kit Mun
01 Jul 2023
Journal of Physics: Conference Series | VOL. 2523

PO-276 The Comprehensive Review of Physical Training of Chinese Ice Hockey Players
Chong Chen ... Mingbo Wang
Exercise Biochemistry Review | VOL. 1
Chong Chen, et. al.Chong Chen ... Mingbo Wang
04 Oct 2018
Exercise Biochemistry Review | VOL. 1

A comprehensive survey on model compression and acceleration
Tejalal Choudhary ... Vipul Mishra
Artificial Intelligence Review | VOL. 53
Tejalal Choudhary, et. al.Tejalal Choudhary ... Vipul Mishra
08 Feb 2020
Artificial Intelligence Review | VOL. 53

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Training Multi-Bit Quantized and Binarized Networks with a Learnable Symmetric Quantizer

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE Access