Structured precision skipping: Accelerating convolutional neural networks with budget-aware dynamic precision selection

Kai Huang,Siang Chen,Bowen Li,Luc Claesen,Hao Yao,Junjian Chen,Xiaowen Jiang,Zhili Liu,Dongliang Xiong

doi:10.1016/j.sysarc.2022.102403

Abstract

Despite the remarkable advancement in various intelligence tasks achieved by Convolutional Neural Networks, the massive computation and storage consumption limit applications on resource-constrained devices. Existing works explore to reduce computation cost by leveraging the input-dependent redundancy at runtime. The irregular dynamic sparsity distribution, however, limits the real speedup for dynamic models deployed in traditional neural network accelerators. To solve this problem, we propose an algorithm-architecture co-design, named structured precision skipping (SPS), to exploit the dynamic precision redundancy in statically quantized models. SPS computes most neurons in a lower precision and only a small portion of important neurons in a higher precision to preserve performance. Specifically, we first propose the structured dynamic block to exploit the dynamic sparsity in a structured manner. Based on the block, we then apply a budget-aware training method by inducing a budget regularization to learn the precision skipping under a target resource constraint. Finally, we present an architecture design based on the bit-serial architecture with support for SPS models, where only a predict controller module with small overhead is introduced. Extensive evaluation results demonstrate that SPS can achieve up to 1.5× speedup and 1.4× energy saving on various models and datasets with marginal accuracy loss.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Structured precision skipping: Accelerating convolutional neural networks with budget-aware dynamic precision selection

Abstract

Talk to us

Similar Papers

More From: Journal of Systems Architecture

Lead the way for us

Journal: Journal of Systems Architecture	Publication Date: Jan 24, 2022
Citations: 1

Similar Papers

Research on improved convolutional wavelet neural network
Jingwei Liu ... Jiaxin Li
Scientific Reports | VOL. 11
Jingwei Liu, et. al.Jingwei Liu ... Jiaxin Li
09 Sep 2021
Scientific Reports | VOL. 11

Enabling On-Device CNN Training by Self-Supervised Instance Filtering and Error Map Pruning
Yawen Wu ... Jingtong Hu
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 39
Yawen Wu, et. al.Yawen Wu ... Jingtong Hu
02 Oct 2020
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 39

Performance Analysis of Content Based Image Retrieval Systems
Arko Gupta ... Veenu
-
Arko Gupta, et. al.Arko Gupta ... Veenu
01 Sep 2018
01 Sep 2018

Comparative Analysis of the Application of Multilayer and Convolutional Neural Networks for Recognition of Handwritten Letters of the Azerbaijani Alphabet
Elshan Mustafayev ... Rustam Azimov
Cybernetics and Computer Technologies | VOL. -
Elshan Mustafayev, et. al.Elshan Mustafayev ... Rustam Azimov
30 Sep 2021
Cybernetics and Computer Technologies | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Structured precision skipping: Accelerating convolutional neural networks with budget-aware dynamic precision selection

Abstract

Talk to us

Similar Papers

More From: Journal of Systems Architecture