Quantization-Based Optimization Algorithm for Hardware Implementation of Convolution Neural Networks

Bassam J. Mohd,Anas AlMajali,Thaier Hayajneh,Khalil M. Ahmad Yousef

doi:10.3390/electronics13091727

Bassam J. Mohd, Anas AlMajali + Show 2 more

Open Access

https://doi.org/10.3390/electronics13091727

Copy DOI

Journal: Electronics	Publication Date: Apr 30, 2024
License type: CC BY 4.0

Affiliation: Hashemite University, Fordham University

Abstract

Convolutional neural networks (CNNs) have demonstrated remarkable performance in many areas but require significant computation and storage resources. Quantization is an effective method to reduce CNN complexity and implementation. The main research objective is to develop a scalable quantization algorithm for CNN hardware design and model the performance metrics for the purpose of CNN implementation in resource-constrained devices (RCDs) and optimizing layers in deep neural networks (DNNs). The algorithm novelty is based on blending two quantization techniques to perform full model quantization with optimum accuracy, and without additional neurons. The algorithm is applied to a selected CNN model and implemented on an FPGA. Implementing CNN using broad data is not possible due to capacity issues. With the proposed quantization algorithm, we succeeded in implementing the model on the FPGA using 16-, 12-, and 8-bit quantization. Compared to the 16-bit design, the 8-bit design offers a 44% decrease in resource utilization, and achieves power and energy reductions of 41% and 42%, respectively. Models show that trading off one quantization bit yields savings of approximately 5.4K LUTs, 4% logic utilization, 46.9 mW power, and 147 μJ energy. The models were also used to estimate performance metrics for a sample DNN design.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Quantization-Based Optimization Algorithm for Hardware Implementation of Convolution Neural Networks

Abstract

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Artificial intelligence: finding the intersection of predictive modeling and clinical utility
Karthik Ravi
Gastrointestinal Endoscopy | VOL. 93
Karthik RaviKarthik Ravi
07 Mar 2021
Gastrointestinal Endoscopy | VOL. 93

Tunnel boring machine vibration-based deep learning for the ground identification of working faces
Mengbo Liu ... Yanqing Men
Journal of Rock Mechanics and Geotechnical Engineering | VOL. 13
Mengbo Liu, et. al.Mengbo Liu ... Yanqing Men
01 Dec 2021
Journal of Rock Mechanics and Geotechnical Engineering | VOL. 13

Aspects of programming for implementation of convolutional neural networks on multisystem HPC architectures
Sunil Pandey ... Shrish Verma
Journal of Physics: Conference Series | VOL. 2062
Sunil Pandey, et. al.Sunil Pandey ... Shrish Verma
01 Nov 2021
Journal of Physics: Conference Series | VOL. 2062

RS-DeepSuperLearner: fusion of CNN ensemble for remote sensing scene classification
Haikel Alhichri
Annals of GIS | VOL. 29
Haikel AlhichriHaikel Alhichri
02 Jan 2023
Annals of GIS | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Quantization-Based Optimization Algorithm for Hardware Implementation of Convolution Neural Networks

Abstract

Talk to us

Similar Papers

More From: Electronics