PELLR: A Permutated ELLPACK-R Format for SpMV on GPUs

Zhiqi Wang,Tongxiang Gu

doi:10.4236/jcc.2020.84004

Abstract

The sparse matrix vector multiplication (SpMV) is inevitable in almost all kinds of scientific computation, such as iterative methods for solving linear systems and eigenvalue problems. With the emergence and development of Graphics Processing Units (GPUs), high efficient formats for SpMV should be constructed. The performance of SpMV is mainly determinted by the storage format for sparse matrix. Based on the idea of JAD format, this paper improved the ELLPACK-R format, reduced the waiting time between different threads in a warp, and the speed up achieved about 1.5 in our experimental results. Compared with other formats, such as CSR, ELL, BiELL and so on, our format performance of SpMV is optimal over 70 percent of the test matrix. We proposed a method based on parameters to analyze the performance impact on different formats. In addition, a formula was constructed to count the computation and the number of iterations.

Highlights

The Sparse matrix vector multiplication (SpMV) is a key operation in for a variety of computation science, such as in many iterative methods for solving linear systems ( Ax = b ), image processing, simulation and so on
There are many storage formats related to sparse matrix, such as compressed sparse row (CSR), ELL, hybrid format (HYB), BiELL and so on
The calculation of sparse matrix vector multiplication (SpMV) based on COO format is not suitable for Graphics Processing Units (GPUs) structure when the matrix is stored in disorder

Summary

Introduction

GPU including many Stream Processors, and many threads can simultaneously calculate multiple groups of data, with high computational power and very high memory bandwidth. In order to improve the computational efficiency, it is important to make changes to find a suitable matrix storage format and calculation method. There are many storage formats related to sparse matrix, such as CSR, ELL, HYB, BiELL and so on. In [5] we can see ELL performance for the structured matrices because it has continuous access to memory. The ELLPACK-R format presented in [6] is optimized to reduce the waiting time between different threads.

Basic Formats to Sparse Matrices

COO Format

CSR Format

ELL-Like Formats

Our New Format

Numerical Result

Findings

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Computer and Communications	Publication Date: Jan 1, 2020
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

PELLR: A Permutated ELLPACK-R Format for SpMV on GPUs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Computer and Communications

Lead the way for us

Similar Papers

A new sparse matrix vector multiplication graphics processing unit algorithm designed for finite element problems
J Wong ... E Kuhl
International Journal for Numerical Methods in Engineering | VOL. 102
J Wong, et. al.J Wong ... E Kuhl
09 Jan 2015
International Journal for Numerical Methods in Engineering | VOL. 102

Yet another Hybrid Strategy for Auto-tuning SpMV on GPUs
Zhaohui Wang ... Aimin Zhang
International Journal of Software Engineering and Its Applications | VOL. 9
Zhaohui Wang, et. al.Zhaohui Wang ... Aimin Zhang
31 May 2015
International Journal of Software Engineering and Its Applications | VOL. 9

A model-driven partitioning and auto-tuning integrated framework for sparse matrix-vector multiplication on GPUs
Ping Guo ... He Huang
-
Ping Guo, et. al.Ping Guo ... He Huang
18 Jul 2011
18 Jul 2011

Sparse Matrix Sparse Vector Multiplication - A Novel Approach
Monika Shah
-
Monika ShahMonika Shah
01 Sep 2015
01 Sep 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PELLR: A Permutated ELLPACK-R Format for SpMV on GPUs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Computer and Communications