Efficient Sparse Cholesky Factorization on a Massively Parallel SIMD Computer

Fredrik Manne,Hjálmtýr Hafsteinsson

doi:10.1137/0916054

Abstract

We investigate the effect of load balancing when performing Cholesky factorization on a massively parallel SIMD computer. In particular we describe a supernodal algorithm for performing sparse Cholesky factorization. The way the matrix is mapped onto the processors has significant effect on its efficiency. We show that this assignment problem can be modeled as a graph coloring problem in a weighted graph. By a simple greedy algorithm, we obtain substantial speedup compared with previously suggested data mapping schemes. Experimental runs have been made on a 16K processor MasPar MP-2 parallel computer using symmetric test matrices with irregular sparsity structure. On these problems our implementation achieves performance rates of well above 200 Mflops in double precision arithmetic.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient Sparse Cholesky Factorization on a Massively Parallel SIMD Computer

Abstract

Talk to us

Similar Papers

More From: SIAM journal on scientific computing : a publication of the Society for Industrial and Applied Mathematics

Lead the way for us

Journal: SIAM journal on scientific computing : a publication of the Society for Industrial and Applied Mathematics	Publication Date: Jul 1, 1995
Citations: 4

Similar Papers

Highly scalable parallel algorithms for sparse matrix factorization
A. Gupta ... G. Karypis
IEEE Transactions on Parallel and Distributed Systems | VOL. 8
A. Gupta, et. al.A. Gupta ... G. Karypis
01 May 1997
IEEE Transactions on Parallel and Distributed Systems | VOL. 8

A Guide for Achieving High Performance with Very Small Matrices on GPU: A Case Study of Batched LU and Cholesky Factorizations
Azzam Haidar ... Ahmad Abdelfattah
IEEE Transactions on Parallel and Distributed Systems | VOL. 29
Azzam Haidar, et. al.Azzam Haidar ... Ahmad Abdelfattah
03 Jan 2018
IEEE Transactions on Parallel and Distributed Systems | VOL. 29

A Scalable High Performant Cholesky Factorization for Multicore with GPU Accelerators
Hatem Ltaief ... Rajib Nath
-
Hatem Ltaief, et. al.Hatem Ltaief ... Rajib Nath
01 Jan 2010
01 Jan 2010

Computer Solution of Large Linear Systems

-

01 Jan 1998
01 Jan 1998

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient Sparse Cholesky Factorization on a Massively Parallel SIMD Computer

Abstract

Talk to us

Similar Papers

More From: SIAM journal on scientific computing : a publication of the Society for Industrial and Applied Mathematics