Accelerating Neural Network Inference in Handwritten Digit Recognition — Comparative Study

Winnie Thomas,Virendra Singh

doi:10.1142/s0129626424300010

Abstract

In deep learning, one of the popular subclasses of multilayer perceptron is the Convolutional Neural Networks (CNNs), optimized for image and semantic segmentation tasks. With a large number of floating-point operations and relatively less data transfer in the training phase, CNNs are well suited to be handled by parallel architectures. The high accuracy of CNNs comes at the cost of consequential compute and memory demands. In this article, we present a comparative analysis of a pre-trained CNN framework for recognizing a handwritten digit on different processing platforms. The performance of the neural network as the function of the parallel processing platforms offers parametric insights and factors that influence the inference on state-of-the-art Graphics Processing Units (GPUs) and systolic arrays such as Eyeriss and Tensor Processing Unit (TPU). Through inference time analysis, we observed that the systolic arrays can outperform upto 58.7 times better than a Turing architecture powered GPU. We show that while the efficiency with which the available resources are utilized is higher in the existing GPUs (upto 32%) than the TPUs, the efficiency of application-specific systolic arrays can be on par with that of GPUs. We present the results obtained from three different customized systolic array-based platforms which can be adopted by the designers to decide the hardware optimization goal.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Accelerating Neural Network Inference in Handwritten Digit Recognition — Comparative Study

Abstract

Talk to us

Similar Papers

More From: Parallel Processing Letters

Lead the way for us

Similar Papers

Performance Benchmarking of GPU and TPU on Google Colaboratory for Convolutional Neural Network
Vijeta Sharma ... Gaurav Kumar Gupta
-
Vijeta Sharma, et. al.Vijeta Sharma ... Gaurav Kumar Gupta
01 Jan 2020
01 Jan 2020

Batch Size Influence on Performance of Graphic and Tensor Processing Units During Training and Inference Phases
Yuriy Kochura ... Alexandr Rokovyi
-
Yuriy Kochura, et. al.Yuriy Kochura ... Alexandr Rokovyi
29 Mar 2019
29 Mar 2019

Analysis of Convolutional Neural Networks for Facial Expression Recognition on GPU, TPU and CPU
Anbananthan Pillai Munanday ... Ahmad Shahir Jamaludin
Journal of Advanced Research in Applied Sciences and Engineering Technology | VOL. 31
Anbananthan Pillai Munanday, et. al. Anbananthan Pillai Munanday ... Ahmad Shahir Jamaludin
04 Aug 2023
Journal of Advanced Research in Applied Sciences and Engineering Technology | VOL. 31

Plant Disease Identification Using Machine Learning Algorithms on Single-Board Computers in IoT Environments
George Routis ... Ioanna Roussaki
Electronics | VOL. 13
George Routis, et. al.George Routis ... Ioanna Roussaki
07 Mar 2024
Electronics | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Accelerating Neural Network Inference in Handwritten Digit Recognition — Comparative Study

Abstract

Talk to us

Similar Papers

More From: Parallel Processing Letters