Intra-layer nonuniform quantization of convolutional neural network

Fangxuan Sun,Zhongfeng Wang,Jun Lin

doi:10.1109/wcsp.2016.7752720

Abstract

Deep convolutional neural network (DCNN) has achieved remarkable performance on object detection and speech recognition in recent years. However, the excellent performance of a DCNN incurs high computational complexity and large memory requirement In this paper, an equal distance nonuniform quantization (ENQ) scheme and a K-means clustering nonuniform quantization (KNQ) scheme are proposed to reduce the required memory storage when low complexity hardware or software implementations are considered. For the VGG-16 and the AlexNet, the proposed nonuniform quantization schemes reduce the number of required memory storage by approximately 50% while achieving almost the same or even better classification accuracy compared to the state-of-the-art quantization method. Compared to the ENQ scheme, the proposed KNQ scheme provides a better tradeoff when higher accuracy is required.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Intra-layer nonuniform quantization of convolutional neural network

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Variable non-uniform quantized belief propagation algorithm for LDPC decoding
Binbin Liu ... Dong Bai
Journal of Electronics (China) | VOL. 25
Binbin Liu, et. al.Binbin Liu ... Dong Bai
01 Jul 2008
Journal of Electronics (China) | VOL. 25

Efficient Quantization Method for Biometric Fingerprint Image Compression
B Emmanuel ... S Sani
British Journal of Mathematics & Computer Science | VOL. 10
B Emmanuel, et. al.B Emmanuel ... S Sani
10 Jan 2015
British Journal of Mathematics & Computer Science | VOL. 10

Nonuniformly quantized min-sum decoder architecture for low-density parity-check codes
Daesun Oh ... Keshab K Parhi
-
Daesun Oh, et. al.Daesun Oh ... Keshab K Parhi
04 May 2008
04 May 2008

Hybrid and non-uniform quantization methods using retro synthesis data for efficient inference
Gvsl Tej Pratap ... Ns Pradeep
-
Gvsl Tej Pratap, et. al.Gvsl Tej Pratap ... Ns Pradeep
18 Jul 2021
18 Jul 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Intra-layer nonuniform quantization of convolutional neural network

Abstract

Talk to us

Similar Papers