Research on the conjugate gradient algorithm with a modified incomplete Cholesky preconditioner on GPU

Jiaquan Gao,Ronghua Liang,Jun Wang

doi:10.1016/j.jpdc.2013.10.002

Abstract

In this study, we discover the parallelism of the forward/backward substitutions (FBS) for two cases and thus propose an efficient preconditioned conjugate gradient algorithm with the modified incomplete Cholesky preconditioner on the GPU (GPUMICPCGA). For our proposed GPUMICPCGA, the following are distinct characteristics: (1) the vector operations are optimized by grouping several vector operations into single kernels, (2) a new kernel of inner product and a new kernel of the sparse matrix–vector multiplication with high optimization are presented, and (3) an efficient parallel implementation of FBS on the GPU (GPUFBS) for two cases are suggested. Numerical results show that our proposed kernels outperform the corresponding ones presented in CUBLAS or CUSPARSE, and GPUFBS is almost 3 times faster than the implementation of FBS using the CUSPARSE library. Furthermore, GPUMICPCGA has better behavior than its counterpart implemented by the CUBLAS and CUSPARSE libraries.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Research on the conjugate gradient algorithm with a modified incomplete Cholesky preconditioner on GPU

Abstract

Talk to us

Similar Papers

More From: Journal of Parallel and Distributed Computing

Lead the way for us

Journal: Journal of Parallel and Distributed Computing	Publication Date: Oct 30, 2013
Citations: 23

Similar Papers

A Cholesky preconditioned conjugate gradient algorithm on GPU for the 3D parabolic equation
Jiaquan Gao ... Bo Li
International Journal of Computational Science and Engineering | VOL. 11
Jiaquan Gao, et. al.Jiaquan Gao ... Bo Li
01 Jan 2015
International Journal of Computational Science and Engineering | VOL. 11

Modified Incomplete Cholesky Preconditioned Conjugate Gradient Algorithm on GPU for the 3D Parabolic Equation
Jiaquan Gao ... Guixia He
-
Jiaquan Gao, et. al.Jiaquan Gao ... Guixia He
01 Jan 2013
01 Jan 2013

Automatic Tuning of Sparse Matrix-Vector Multiplication for CRS Format on GPUs
Hiroki Yoshizawa ... Daisuke Takahashi
-
Hiroki Yoshizawa, et. al.Hiroki Yoshizawa ... Daisuke Takahashi
01 Dec 2012
01 Dec 2012

A Barzilai-Borwein conjugate gradient method
Yuhong Dai ... Caixia Kou
Science China Mathematics | VOL. 59
Yuhong Dai, et. al.Yuhong Dai ... Caixia Kou
04 Jul 2016
Science China Mathematics | VOL. 59

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Research on the conjugate gradient algorithm with a modified incomplete Cholesky preconditioner on GPU

Abstract

Talk to us

Similar Papers

More From: Journal of Parallel and Distributed Computing