Accelerating Volkov's Hybrid Implementation of Cholesky Factorization on a Fermi GPU

Shih-Chieh Wei,Bormin Huang

doi:10.1109/icpads.2012.147

Abstract

In linear algebra, Cholesky factorization is useful in solving a system of equations with a symmetric positive definite coefficient matrix. Cholesky factorization is roughly twice as fast relative to LU factorization which applies to general matrices. In recent years, with advances in technology, a Fermi GPU card can accommodate hundreds of cores compared to the small number of 8 or 16 cores on CPU. Therefore a trend is seen to use the graphics card as a general purpose graphics processing unit (GPGPU) for parallel computation. In this work, Volkov's hybrid implementation of Cholesky factorization is evaluated on the new Fermi GPU with others and then some improvement strategies were proposed. After experiments, compared to the CPU version using Intel Math Kernel Library (MKL), our proposed GPU improvement strategy can achieve a speedup of 3.85x on Cholesky factorization of a square matrix of dimension 10,000.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Accelerating Volkov's Hybrid Implementation of Cholesky Factorization on a Fermi GPU

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Regularized symmetric positive definite matrix factorizations for linear systems arising from RBF interpolation and differentiation
Scott A Sarra
Engineering Analysis with Boundary Elements | VOL. 44
Scott A SarraScott A Sarra
20 May 2014
Engineering Analysis with Boundary Elements | VOL. 44

Set-to-Set Distance Metric Learning on SPD Manifolds
Zhi Gao ... Yunde Jia
-
Zhi Gao, et. al.Zhi Gao ... Yunde Jia
01 Jan 2018
01 Jan 2018

Schur-type methods based on subspace considerations
J Gotze ... H Park
-
J Gotze, et. al.J Gotze ... H Park
09 Jun 1997
09 Jun 1997

Riemannian Geometry of Symmetric Positive Definite Matrices via Cholesky Decomposition
Zhenhua Lin
SIAM Journal on Matrix Analysis and Applications | VOL. 40
Zhenhua LinZhenhua Lin
01 Jan 2019
SIAM Journal on Matrix Analysis and Applications | VOL. 40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Accelerating Volkov's Hybrid Implementation of Cholesky Factorization on a Fermi GPU

Abstract

Talk to us

Similar Papers