Performance Evaluation Model for Matrix Calculation on GPU

Mengjia Yin,Xianbin Xu,Conghuan Ye,Tao Zhang

doi:10.1142/s0218001421540306

Abstract

Establishment of a performance evaluation model is a hotspot of current research. In this paper, the performance bottleneck is analyzed quantitatively, which provided programmers with a guidance to optimize the performance bottleneck. This paper takes a matrix as an example; the matrix is divided into a dense matrix or a sparse matrix. For dense matrix, the performance is first analyzed in a quantitative way, and an evaluation model is developed, which includes the instruction pipeline, shared memory, and global memory. For sparse matrix, this paper aims at the four formats of CSR, ELL, COO, and HYB, through the observation data obtained from the actual operation of large datasets, finds the relationship between the running time, dataset form, and storage model, and establishes their relational model functions. Through practical test and comparison, the error between the execution time of the test dataset that is predicted by the model function and the actual running time is found to be within a stable finite deviation threshold, proving that the model has certain practicability.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Performance Evaluation Model for Matrix Calculation on GPU

Abstract

Talk to us

Similar Papers

More From: International Journal of Pattern Recognition and Artificial Intelligence

Lead the way for us

Journal: International Journal of Pattern Recognition and Artificial Intelligence	Publication Date: Oct 15, 2021
Citations: 1

Similar Papers

GE-SpMM: General-Purpose Sparse Matrix-Matrix Multiplication on GPUs for Graph Neural Networks
Guyue Huang ... Huazhong Yang
-
Guyue Huang, et. al.Guyue Huang ... Huazhong Yang
01 Nov 2020
01 Nov 2020

Full Parameter Time Complexity (FPTC): A Method to Evaluate the Running Time of Machine Learning Classifiers for Land Use/Land Cover Classification
Xiaorou Zheng ... Yingfei Xiong
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 14
Xiaorou Zheng, et. al.Xiaorou Zheng ... Yingfei Xiong
01 Jan 2020
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 14

Analyzing throughput and utilization on trestles
Richard L Moore ... Adam Jundt
-
Richard L Moore, et. al.Richard L Moore ... Adam Jundt
16 Jul 2012
16 Jul 2012

Scheduling Jobs on Parallel Systems Using a Relaxed Backfill Strategy
William A Ward ... John E West
-
William A Ward, et. al.William A Ward ... John E West
01 Jan 2002
01 Jan 2002

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance Evaluation Model for Matrix Calculation on GPU

Abstract

Talk to us

Similar Papers

More From: International Journal of Pattern Recognition and Artificial Intelligence