Performance modeling of the sparse matrix–vector product via convolutional neural networks

Maria Barreda,M Asunción Castaño,Manuel F Dolz,Enrique S Quintana-Ortí,Pedro Alonso-Jordá

doi:10.1007/s11227-020-03186-1

Maria Barreda, M Asunción Castaño + Show 3 more

Open Access

https://doi.org/10.1007/s11227-020-03186-1

Copy DOI

Abstract

Modeling the execution time of the sparse matrix–vector multiplication (SpMV) on a current CPU architecture is especially complex due to (i) irregular memory accesses; (ii) indirect memory referencing; and (iii) low arithmetic intensity. While analytical models may yield accurate estimates for the total number of cache hits/misses, they often fail to predict accurately the total execution time. In this paper, we depart from the analytic approach to instead leverage convolutional neural networks (CNNs) in order to provide an effective estimation of the performance of the SpMV operation. For this purpose, we present a high-level abstraction of the sparsity pattern of the problem matrix and propose a blockwise strategy to feed the CNN models by blocks of nonzero elements. The experimental evaluation on a representative subset of the matrices from the SuiteSparse Matrix collection demonstrates the robustness of the CNN models for predicting the SpMV performance on an Intel Haswell core. Furthermore, we show how to generalize the network models to other target architectures to estimate the performance of SpMV on an ARM A57 core.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: The Journal of Supercomputing	Publication Date: Feb 4, 2020
Citations: 7	License type: other-oa

R Discovery Prime

R Discovery Prime

Performance modeling of the sparse matrix–vector product via convolutional neural networks

Abstract

Talk to us

Similar Papers

More From: The Journal of Supercomputing

Lead the way for us

Similar Papers

Artificial intelligence: finding the intersection of predictive modeling and clinical utility
Karthik Ravi
Gastrointestinal Endoscopy | VOL. 93
Karthik RaviKarthik Ravi
07 Mar 2021
Gastrointestinal Endoscopy | VOL. 93

Prediction of Diabetic Retinopathy using Deep Learning with Preprocessing
S Balaji ... D Gokulakrishnan
EAI Endorsed Transactions on Pervasive Health and Technology | VOL. 10
S Balaji, et. al.S Balaji ... D Gokulakrishnan
22 Feb 2024
EAI Endorsed Transactions on Pervasive Health and Technology | VOL. 10

Evaluating a Comparing Deep Learning Architectures for Blood Glucose Prediction
Touria El Idrissi ... Ali Idri
-
Touria El Idrissi, et. al.Touria El Idrissi ... Ali Idri
01 Jan 2020
01 Jan 2020

Hyperspectral signature-band extraction and learning: an example of sugar content prediction of Syzygium samarangense
Yung-Jhe Yan ... Mang Ou-Yang
Scientific Reports | VOL. 13
Yung-Jhe Yan, et. al.Yung-Jhe Yan ... Mang Ou-Yang
12 Sep 2023
Scientific Reports | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance modeling of the sparse matrix–vector product via convolutional neural networks

Abstract

Talk to us

Similar Papers

More From: The Journal of Supercomputing