GPU-based multifrontal optimizing method in sparse Cholesky factorization

Ran Zheng,Han Jiang,Song Wu,Yong Chen,Hai Jin,Wei Wang

doi:10.1109/asap.2015.7245714

Abstract

In many scientific computing applications, sparse Cholesky factorization is used to solve large sparse linear equations in distributed environment. GPU computing is a new way to solve the problem. However, sparse Cholesky factorization on GPU is hardly to achieve excellent performance due to the structure irregularity of matrix and the low GPU resource utilization. A hybrid CPU-GPU implementation of sparse Cholesky factorization is proposed based on multifrontal method. A large sparse coefficient matrix is decomposed into a series of small dense matrices (frontal matrices) in the method, and then multiple GEMM (General Matrix-matrix Multiplication) operations are computed. GEMMs are the main operations in sparse Cholesky factorization, but they are hardly to perform better in parallel on GPU. In order to improve the performance, the scheme of multiple task queues is adopted when performing multiple GEMMs parallelized with multifrontal method; all GEMM tasks are scheduled dynamically on GPU and CPU based on computation scales for load balance and computing-time reduction. Experimental results show that the approach can outperform the implementations of BLAS and cuBLAS, achieving up to 3.15× and 1.98× speedup, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

GPU-based multifrontal optimizing method in sparse Cholesky factorization

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A Hybrid CPU-GPU Multifrontal Optimizing Method in Sparse Cholesky Factorization
Yong Chen ... Hai Jin
Journal of Signal Processing Systems | VOL. 90
Yong Chen, et. al.Yong Chen ... Hai Jin
24 Feb 2017
Journal of Signal Processing Systems | VOL. 90

The multifrontal method and paging in sparse Cholesky factorization
Joseph W H Liu
ACM Transactions on Mathematical Software | VOL. 15
Joseph W H LiuJoseph W H Liu
01 Dec 1989
ACM Transactions on Mathematical Software | VOL. 15

Implementation of parallel sparse Cholesky factorization on GPU
Dan Zou ... Yong Dou
-
Dan Zou, et. al.Dan Zou ... Yong Dou
01 Dec 2012
01 Dec 2012

Block sparse Cholesky algorithms on advanced uniprocessor computers
E.G Ng ... B.W Peyton
-
E.G Ng, et. al.E.G Ng ... B.W Peyton
01 Dec 1991
01 Dec 1991

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

GPU-based multifrontal optimizing method in sparse Cholesky factorization

Abstract

Talk to us

Similar Papers