An Architecture-aware Technique for Optimizing Sparse Matrix-vector Multiplication on GPUs

Marco Maggioni,Tanya Berger-Wolf

doi:10.1016/j.procs.2013.05.196

Abstract

The sparse matrix-vector multiplication (SpMV) is a fundamental kernel used in computational science. As a result, the performance of a large number of applications depends on the efficiency of the SpMV. This kernel is, in fact, a bandwidth- limited operation and poses a challenge for optimization when the matrix has an irregular structure. Over the last few years, a large body of research has been devoted to implementing SpMV on throughput-oriented manycore processors. Several sparse matrix formats have been proposed, with different strengths and weaknesses, as well as other alternative optimization strategies such as row reordering.This paper proposes the design of an architecture-aware technique for improving the performance of the SpMV on Graphic Processing Units (GPUs). This optimization is based on a novel heuristic capable of reducing cache memory accesses within hardware-level thread blocks (warps). The technique is designed and implemented using a variation of the sliced ELL sparse format. However, the underlying idea is structure-independent and can be easily adapted to other sparse representations. We tested the proposed architecture-aware optimization on a large set of benchmarks from heterogeneous application domains. The results show consistent improvements for double-precision calculations, an average 9% increase in performance with speedups up to 2.24 over the baseline.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Architecture-aware Technique for Optimizing Sparse Matrix-vector Multiplication on GPUs

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science

Lead the way for us

Journal: Procedia Computer Science	Publication Date: Jan 1, 2013
Citations: 24

Similar Papers

CoAdELL: Adaptivity and Compression for Improving Sparse Matrix-Vector Multiplication on GPUs
Marco Maggioni ... Tanya Berger-Wolf
-
Marco Maggioni, et. al.Marco Maggioni ... Tanya Berger-Wolf
01 May 2014
01 May 2014

A GPU Framework for Sparse Matrix Vector Multiplication
B Neelima ... G Ram Mohana Reddy
-
B Neelima, et. al.B Neelima ... G Ram Mohana Reddy
01 Jun 2014
01 Jun 2014

ClSpMV
Bor-Yiing Su ... Kurt Keutzer
-
Bor-Yiing Su, et. al.Bor-Yiing Su ... Kurt Keutzer
25 Jun 2012
25 Jun 2012

AdELL: An Adaptive Warp-Balancing ELL Format for Efficient Sparse Matrix-Vector Multiplication on GPUs
Marco Maggioni ... Tanya Berger-Wolf
-
Marco Maggioni, et. al.Marco Maggioni ... Tanya Berger-Wolf
01 Oct 2013
01 Oct 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Architecture-aware Technique for Optimizing Sparse Matrix-vector Multiplication on GPUs

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science