A Precision-Scalable Energy-Efficient Convolutional Neural Network Accelerator

Wenjian Liu,Zhongfeng Wang,Jun Lin

doi:10.1109/tcsi.2020.2993051

Abstract

Quantization is a promising technique to compress the size of Convolutional Neural Network (CNN) models. Recently, various precision-scalable designs have been presented to reduce the computational complexity in CNNs. However, most of them adopt straightforward calculation scheme to implement the CNN, which causes high bandwidth requirement and low hardware utilization efficiency. This paper proposes a new precision-scalable architecture which can fully reduce the computational complexity in CNN inference and meanwhile has a finely simplified calculation scheme. Based on the proposed scheme, a well-optimized multiplier called Compositional Processing Element (C-PE) is devised. Compared with the previous multipliers, the new C-PE requires less area and power. Furthermore, two levels of optimization are introduced to the design to relieve the bandwidth problem and increase the hardware utilization efficiency. Implemented under the TSMC 90nm CMOS technology, the whole design achieves 6-68.1 fps in various precisions on VGG16 benchmark and a 49.8TOPS/W energy efficiency at 500MHz when scaled to 28nm, which is much better than previous precision-scalable ones.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Precision-Scalable Energy-Efficient Convolutional Neural Network Accelerator

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems I: Regular Papers

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems I: Regular Papers	Publication Date: Oct 1, 2020
Citations: 39

Similar Papers

Characterizing Memory Access Patterns of Various Convolutional Neural Networks for Utilizing Processing-in-Memory
Jihoon Jang ... Hyokeun Lee
-
Jihoon Jang, et. al.Jihoon Jang ... Hyokeun Lee
05 Feb 2023
05 Feb 2023

An Efficient Design Flow for Accelerating Complicated-connected CNNs on a Multi-FPGA Platform
Deguang Wang ... Junzhong Shen
-
Deguang Wang, et. al.Deguang Wang ... Junzhong Shen
05 Aug 2019
05 Aug 2019

Photonic Reconfigurable Accelerators for Efficient Inference of CNNs With Mixed-Sized Tensors
Sairam Sri Vatsavai ... Ishan G Thakkar
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 41
Sairam Sri Vatsavai, et. al.Sairam Sri Vatsavai ... Ishan G Thakkar
01 Nov 2022
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 41

A Survey of Convolutional Neural Networks on Edge with Reconfigurable Computing
Mário P Véstias
Algorithms | VOL. 12
Mário P VéstiasMário P Véstias
31 Jul 2019
Algorithms | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Precision-Scalable Energy-Efficient Convolutional Neural Network Accelerator

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems I: Regular Papers