A Factored Sparse Approximate Inverse Preconditioned Conjugate Gradient Solver on Graphics Processing Units

Massimo Bernaschi,Carlo Fantozzi,Mauro Bisson,Carlo Janna

doi:10.1137/15m1027826

Massimo Bernaschi, Carlo Fantozzi + Show 2 more

Open Access

https://doi.org/10.1137/15m1027826

Copy DOI

Journal: SIAM Journal on Scientific Computing	Publication Date: Jan 1, 2016
Citations: 18	License type: other-oa

Abstract

Graphics Processing Units (GPUs) exhibit significantly higher peak performance than conventional CPUs. However, in general only highly parallel algorithms can exploit their potential. In this scenario, the iterative solution to sparse linear systems of equations could be carried out quite efficiently on a GPU as it requires only matrix-by-vector products, dot products, and vector updates. However, to be really effective, any iterative solver needs to be properly preconditioned and this represents a major bottleneck for a successful GPU implementation. Due to its inherent parallelism, the factored sparse approximate inverse (FSAI) preconditioner represents an optimal candidate for the conjugate gradient--like solution of sparse linear systems. However, its GPU implementation requires a nontrivial recasting of multiple computational steps. We present our GPU version of the FSAI preconditioner along with a set of results that show how a noticeable speedup with respect to a highly tuned CPU counterpart is obt...

Full Text