Quantization of Deep Neural Networks for Accurate Edge Computing

Wentao Chen,Xiaowe Xu,Tianchen Wang,Meiping Huang,Chutong Zhang,Hailong Qiu,Qing Lu,Yiyu Shi,Yu Hu,Jian Zhuang

doi:10.1145/3451211

Abstract

Deep neural networks have demonstrated their great potential in recent years, exceeding the performance of human experts in a wide range of applications. Due to their large sizes, however, compression techniques such as weight quantization and pruning are usually applied before they can be accommodated on the edge. It is generally believed that quantization leads to performance degradation, and plenty of existing works have explored quantization strategies aiming at minimum accuracy loss. In this paper, we argue that quantization, which essentially imposes regularization on weight representations, can sometimes help to improve accuracy. We conduct comprehensive experiments on three widely used applications: fully connected network for biomedical image segmentation, convolutional neural network for image classification on ImageNet, and recurrent neural network for automatic speech recognition, and experimental results show that quantization can improve the accuracy by 1%, 1.95%, 4.23% on the three applications respectively with 3.5x-6.4x memory reduction.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Quantization of Deep Neural Networks for Accurate Edge Computing

Abstract

Talk to us

Similar Papers

More From: ACM Journal on Emerging Technologies in Computing Systems

Lead the way for us

Journal: ACM Journal on Emerging Technologies in Computing Systems	Publication Date: Jun 30, 2021
Citations: 8

Similar Papers

Hybrid Hidden Markov Model and Artificial Neural Network for Automatic Speech Recognition
Xian Tang
-
Xian TangXian Tang
01 May 2009
01 May 2009

Robust quantization of deep neural networks
Youngseok Kim ... Jiwon Seo
-
Youngseok Kim, et. al.Youngseok Kim ... Jiwon Seo
22 Feb 2020
22 Feb 2020

Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition
Tara N Sainath ... Kevin W Wilson
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 25
Tara N Sainath, et. al.Tara N Sainath ... Kevin W Wilson
01 May 2017
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 25

Optimization of Deep Neural Network for Automatic Speech Recognition
Aqbal Waris ... R.K Aggarwal
-
Aqbal Waris, et. al.Aqbal Waris ... R.K Aggarwal
01 Jul 2018
01 Jul 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Quantization of Deep Neural Networks for Accurate Edge Computing

Abstract

Talk to us

Similar Papers

More From: ACM Journal on Emerging Technologies in Computing Systems