Booth Encoding-Based Energy Efficient Multipliers for Deep Learning Systems

Muhammad Hamis Haider,Seok-Bum Ko

doi:10.1109/tcsii.2022.3233923

Abstract

Artificial intelligence on edge is a growing research field. In this paper, we propose a novel re-encoding scheme for reducing the size of the weights of deep neural networks (DNNs). The proposed re-encoding scheme exploits the Booth encoding scheme and the power-of-two (PO2) quantization to allow for very low energy computations during the inference of the neural networks with minimal loss in classification accuracy. We demonstrate the advantages of the proposed re-encoding scheme by computing a convolutional neural network (CNN) and a linear neural network on the proposed Extended Exact Multiplier and the proposed PO2 Multiplier. Our proposed PO2 quantization and re-encoding method reduce the model size for the CNN by 30.77% and the model size of the linear neural network by 49.86%. Furthermore, our multipliers reduce the inference energy for CNN by 50.6% and for the linear neural network by 90.1%. The PO2 Multiplier is proposed for the sensor-end computation of the linear neural network with a 77.32% reduction in the area relative to an exact Booth multiplier and it reduces the inference energy consumption of the linear neural network by 93.2% when compared to the unmodified exact multiplier. Our proposed scheme can be used to improve the energy consumption during inference for most Booth multipliers with minor modifications to the re-encoding signal arrangements. We also demonstrate that the proposed re-encoding scheme paired with the proposed multipliers outperforms all the existing designs in terms of resource utilization with a minimal impact on the inference accuracy of the neural networks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Booth Encoding-Based Energy Efficient Multipliers for Deep Learning Systems

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems II: Express Briefs

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems II: Express Briefs	Publication Date: Jun 1, 2023
Citations: 7

Similar Papers

A fast and efficient pre-training method based on layer-by-layer maximum discrimination for deep neural networks
Seyyede Zohreh Seyyedsalehi ... Seyyed Ali Seyyedsalehi
Neurocomputing | VOL. 168
Seyyede Zohreh Seyyedsalehi, et. al.Seyyede Zohreh Seyyedsalehi ... Seyyed Ali Seyyedsalehi
22 May 2015
Neurocomputing | VOL. 168

A convergence analysis of Nesterov’s accelerated gradient method in training deep linear neural networks
Xin Liu ... Zhisong Pan
Information Sciences | VOL. 612
Xin Liu, et. al.Xin Liu ... Zhisong Pan
05 Sep 2022
Information Sciences | VOL. 612

Recognizing spatial distribution patterns of grassland insects: neural network approaches
Wenjun Zhang ... Xiaoqing Zhong
Stochastic Environmental Research and Risk Assessment | VOL. 22
Wenjun Zhang, et. al.Wenjun Zhang ... Xiaoqing Zhong
27 Feb 2007
Stochastic Environmental Research and Risk Assessment | VOL. 22

Input Dimension Determination of Linear Feedback Neural Network Applied for System Identification of Linear Systems
Wenle Zhang
-
Wenle ZhangWenle Zhang
01 Jan 2017
01 Jan 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Booth Encoding-Based Energy Efficient Multipliers for Deep Learning Systems

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems II: Express Briefs