The Sliced COO Format for Sparse Matrix-Vector Multiplication on CUDA-enabled GPUs

Hoang-Vu Dang,Bertil Schmidt

doi:10.1016/j.procs.2012.04.007

Hoang-Vu Dang, Bertil Schmidt

Open Access

https://doi.org/10.1016/j.procs.2012.04.007

Copy DOI

Export

Save

Cite

Journal: Procedia Computer Science	Publication Date: Jan 1, 2012
Citations: 18	License type: cc-by-nc-nd

Affiliation: Johannes Gutenberg University Mainz

Abstract
Full-Text
Similar Papers

Abstract

Listen

Existing formats for Sparse Matrix-Vector Multiplication (SpMV) on the GPU are outperforming their corresponding implementations on multi-core CPUs. In this paper, we present a new format called Sliced COO (SCOO) and an effcient CUDA implementation to perform SpMV on the GPU. While previous work shows experiments on small to medium-sized sparse matrices, we perform evaluations on large sparse matrices. We compared SCOO performance to existing formats of the NVIDIA Cusp library. Our resutls on a Fermi GPU show that SCOO outperforms the COO and CSR format for all tested matrices and the HYB format for all tested unstructured matrices. Furthermore, comparison to a Sandy-Bridge CPU shows that SCOO on a Fermi GPU outperforms the multi-threaded CSR implementation of the Intel MKL Library on an i7-2700K by a factor between 5.5 and 18.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

The Sliced COO Format for Sparse Matrix-Vector Multiplication on CUDA-enabled GPUs

Abstract

Published Version

Talk to us

Similar Papers

More From: Procedia Computer Science

Lead the way for us

Similar Papers

CUDA-enabled Sparse Matrix–Vector Multiplication on GPUs using atomic operations
Hoang-Vu Dang ... Bertil Schmidt
Parallel Computing | VOL. 39
Hoang-Vu Dang, et. al.Hoang-Vu Dang ... Bertil Schmidt
07 Oct 2013
Parallel Computing | VOL. 39

Large-Scale Sparse Singular Value Computations
Michael W Berry
The International Journal of Supercomputing Applications | VOL. 6
Michael W BerryMichael W Berry
01 Apr 1992
The International Journal of Supercomputing Applications | VOL. 6

COMPUTING EXTREMAL SINGULAR TRIPLETS OF SPARSE MATRICES ON A SHARED-MEMORY MULTIPROCESSOR
M.W Berry ... B.N Parlett
International Journal of High Speed Computing | VOL. 06
M.W Berry, et. al.M.W Berry ... B.N Parlett
01 Jun 1994
International Journal of High Speed Computing | VOL. 06

Sparse matrix partitioning for optimizing SpMV on CPU-GPU heterogeneous platforms
Akrem Benatia ... Yizhuo Wang
The International Journal of High Performance Computing Applications | VOL. 34
Akrem Benatia, et. al.Akrem Benatia ... Yizhuo Wang
14 Nov 2019
The International Journal of High Performance Computing Applications | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

The Sliced COO Format for Sparse Matrix-Vector Multiplication on CUDA-enabled GPUs

Abstract

Published Version

Talk to us

Similar Papers

More From: Procedia Computer Science