Performance Analysis of Parallel Computing in Image Classification

Yicong Hao

doi:10.54097/hset.v41i.6739

Abstract

The use of parallel computing can speed up the training of deep learning models. The traditional neural network model ResNet is chosen for testing in this paper on the effectiveness of data parallelism in image classification, and test data is provided in a 6-GPU environment. In this paper, it is suggested that various factors should be considered when building affordable hardware configurations to expedite model training in practical application scenarios. The communication costs are not insignificant because today's large computing clusters are primarily offered via cloud computing. Another crucial point to remember is that the number of CUDA cores is the primary hardware foundation for GPU acceleration technology. Therefore, acceleration may not be affected by more incredible video memory or fewer CUDA cores for some particular graphics cards. In addition, the issue of beyond-model performance in parallel computing is another issue that must be disregarded. Due to the parallel strategy's limitations, it is essential to tweak the super parameters while speeding up model training. The model's performance is more likely to be ensured by the lower learning rate and Batch size. This paper's experimental conclusion can support building an appropriate hardware configuration scheme.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Performance Analysis of Parallel Computing in Image Classification

Abstract

Talk to us

Similar Papers

More From: Highlights in Science, Engineering and Technology

Lead the way for us

Journal: Highlights in Science, Engineering and Technology	Publication Date: Mar 30, 2023
License type: CC BY-NC 4.0

Similar Papers

Automating Ground Truth Annotations for Gland Segmentation Through Immunohistochemistry
Tushar Kataria ... Shireen Y Elhabian
Modern Pathology | VOL. 36
Tushar Kataria, et. al.Tushar Kataria ... Shireen Y Elhabian
15 Sep 2023
Modern Pathology | VOL. 36

XLA-NDP: Efficient Scheduling and Code Generation for Deep Learning Model Training on Near-Data Processing Memory
Jueon Park ... Hyojin Sung
IEEE Computer Architecture Letters | VOL. 22
Jueon Park, et. al.Jueon Park ... Hyojin Sung
01 Jan 2023
IEEE Computer Architecture Letters | VOL. 22

Improving Image Classification of Knee Radiographs: An Automated Image Labeling Approach.
Jikai Zhang ... Roy Colglazier
Journal of Digital Imaging | VOL. 36
Jikai Zhang, et. al.Jikai Zhang ... Roy Colglazier
24 Aug 2023
Journal of Digital Imaging | VOL. 36

Curious Containers: A framework for computational reproducibility in life sciences with support for Deep Learning applications
Christoph Jansen ... Dagmar Krefting
Future Generation Computer Systems | VOL. 112
Christoph Jansen, et. al.Christoph Jansen ... Dagmar Krefting
18 May 2020
Future Generation Computer Systems | VOL. 112

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance Analysis of Parallel Computing in Image Classification

Abstract

Talk to us

Similar Papers

More From: Highlights in Science, Engineering and Technology