Low-Complexity Precision-Scalable Multiply-Accumulate Unit Architectures for Deep Neural Network Accelerators

Wenjie Li,Gang Wang,Guanghui He,Aokun Hu,Ningyi Xu

doi:10.1109/tcsii.2022.3231418

Abstract

Precision-scalable deep neural network (DNN) accelerator designs have attracted much research interest. Since the computation of most DNNs is dominated by multiply-accumulate (MAC) operations, designing efficient precision-scalable MAC (PSMAC) units is of central importance. This brief proposes two low-complexity PSMAC unit architectures based on the well-known one, Fusion Unit (FU), which is composed of a few basic units called Bit Bricks (BBs). We first simplify the architecture of BB through optimizing some redundant logic. Then a top-level architecture for PSMAC unit is devised by recursively employing BBs. Accordingly, two low-complexity PSMAC unit architectures are presented for two different kinds of quantization schemes. Moreover, we provide an insight into the decomposed multiplications and further reduce the bitwidths of the two architectures. Experimental results show that our proposed architectures can save up to 44.18% area cost and 45.45% power consumption when compared with the state-of-the-art design.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Low-Complexity Precision-Scalable Multiply-Accumulate Unit Architectures for Deep Neural Network Accelerators

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems II: Analog and Digital Signal Processing

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems II: Analog and Digital Signal Processing	Publication Date: Apr 1, 2023
Citations: 4

Similar Papers

An Error Compensation Technique for Low-Voltage DNN Accelerators
Daehan Ji ... Jongsun Park
IEEE Transactions on Very Large Scale Integration Systems | VOL. 29
Daehan Ji, et. al.Daehan Ji ... Jongsun Park
15 Dec 2020
IEEE Transactions on Very Large Scale Integration Systems | VOL. 29

An Arbitrary Kernel-size Applicable NoC-based DNN Processor Design with Hybrid Data Reuse
Kun-Chih Jimmy Chen ... Yueh-Chi Yang
-
Kun-Chih Jimmy Chen, et. al.Kun-Chih Jimmy Chen ... Yueh-Chi Yang
09 Aug 2021
09 Aug 2021

A Reconfigurable Deep Neural Network on Chip Design with Flexible Convolutional Operations
Kun-Chih Chen ... Yi-Sheng Liao
-
Kun-Chih Chen, et. al.Kun-Chih Chen ... Yi-Sheng Liao
02 Oct 2022
02 Oct 2022

Analog-memory-based 14nm Hardware Accelerator for Dense Deep Neural Networks including Transformers
Atsuya Okazaki ... Hsinyu Tsai
-
Atsuya Okazaki, et. al.Atsuya Okazaki ... Hsinyu Tsai
28 May 2022
28 May 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Low-Complexity Precision-Scalable Multiply-Accumulate Unit Architectures for Deep Neural Network Accelerators

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems II: Analog and Digital Signal Processing