Unsupervised Network Quantization via Fixed-Point Factorization.

Peisong Wang,Qingshan Liu,Xiangyu He,Anda Cheng,Qiang Chen,Jian Cheng

doi:10.1109/tnnls.2020.3007749

Abstract

The deep neural network (DNN) has achieved remarkable performance in a wide range of applications at the cost of huge memory and computational complexity. Fixed-point network quantization emerges as a popular acceleration and compression method but still suffers from huge performance degradation when extremely low-bit quantization is utilized. Moreover, current fixed-point quantization methods rely heavily on supervised retraining using large amounts of the labeled training data, while the labeled data are hard to obtain in the real-world applications. In this article, we propose an efficient framework, namely, fixed-point factorized network (FFN), to turn all weights into ternary values, i.e., {-1, 0, 1}. We highlight that the proposed FFN framework can achieve negligible degradation even without any supervised retraining on the labeled data. Note that the activations can be easily quantized into an 8-bit format; thus, the resulting networks only have low-bit fixed-point additions that are significantly more efficient than 32-bit floating-point multiply-accumulate operations (MACs). Extensive experiments on large-scale ImageNet classification and object detection on MS COCO show that the proposed FFN can achieve about more than 20× compression and remove most of the multiply operations with comparable accuracy. Codes are available on GitHub at https://github.com/wps712/FFN.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Unsupervised Network Quantization via Fixed-Point Factorization.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems

Lead the way for us

Journal: IEEE Transactions on Neural Networks and Learning Systems	Publication Date: Jul 24, 2020
Citations: 11

Similar Papers

A Large-scale Detection Algorithm and Application Based on YOLOv4
Xiangbin Shi ... Jinwen Peng
-
Xiangbin Shi, et. al.Xiangbin Shi ... Jinwen Peng
01 Oct 2021
01 Oct 2021

Towards Large-Scale Small Object Detection: Survey and Benchmarks.
Gong Cheng ... Qinghua Zeng
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45
Gong Cheng, et. al.Gong Cheng ... Qinghua Zeng
01 Jan 2023
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45

Object detection and activity recognition in video surveillance using neural networks
Vishva Payghode ... Ashwani Kumar Dubey
International Journal of Web Information Systems | VOL. 19
Vishva Payghode, et. al.Vishva Payghode ... Ashwani Kumar Dubey
20 Apr 2023
International Journal of Web Information Systems | VOL. 19

Low-Complexity Deep Neural Networks for Image Object Classification and Detection
Shen-Fu Hsiao ... Jing-Fu Zhan
-
Shen-Fu Hsiao, et. al.Shen-Fu Hsiao ... Jing-Fu Zhan
01 Nov 2019
01 Nov 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unsupervised Network Quantization via Fixed-Point Factorization.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems