A CPU–GPU hybrid approach for the unsymmetric multifrontal method

Chenhan D Yu,Dan’L Pierce,Weichung Wang

doi:10.1016/j.parco.2011.09.002

Chenhan D Yu, Dan’L Pierce + Show 1 more

Open Access

https://doi.org/10.1016/j.parco.2011.09.002

Copy DOI

Journal: Parallel Computing	Publication Date: Oct 5, 2011
Citations: 47	License type: mit

Affiliation: National Taiwan University

Abstract

Multifrontal is an efficient direct method for solving large-scale sparse and unsymmetric linear systems. The method transforms a large sparse matrix factorization process into a sequence of factorizations involving smaller dense frontal matrices. Some of these dense operations can be accelerated by using a graphic processing unit (GPU). We analyze the unsymmetric multifrontal method from both an algorithmic and implementational perspective to see how a GPU, in particular the NVIDIA Tesla C2070, can be used to accelerate the computations. Our main accelerating strategies include (i) performing BLAS on both CPU and GPU, (ii) improving the communication efficiency between the CPU and GPU by using page-locked memory, zero-copy memory, and asynchronous memory copy, and (iii) a modified algorithm that reuses the memory between different GPU tasks and sets thresholds to determine whether certain tasks be performed on the GPU. The proposed acceleration strategies are implemented by modifying UMFPACK, which is an unsymmetric multifrontal linear system solver. Numerical results show that the CPU–GPU hybrid approach can accelerate the unsymmetric multifrontal solver, especially for computationally expensive problems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A CPU–GPU hybrid approach for the unsymmetric multifrontal method

Abstract

Talk to us

Similar Papers

More From: Parallel Computing

Lead the way for us

Similar Papers

Reduction of computing time for seismic applications based on the Helmholtz equation by Graphics Processing Units

-

03 Mar 2015
03 Mar 2015

Implementation of parallel sparse Cholesky factorization on GPU
Dan Zou ... Yong Dou
-
Dan Zou, et. al.Dan Zou ... Yong Dou
01 Dec 2012
01 Dec 2012

Adding GPU Acceleration to an Industrial CPU-Based Simulator, Development Strategy and Results
Neil Gohaud ... Hui Cao
-
Neil Gohaud, et. al.Neil Gohaud ... Hui Cao
19 Oct 2021
19 Oct 2021

Architecting the finite element method pipeline for the GPU
Zhisong Fu ... Ross T Whitaker
Journal of Computational and Applied Mathematics | VOL. 257
Zhisong Fu, et. al.Zhisong Fu ... Ross T Whitaker
06 Sep 2013
Journal of Computational and Applied Mathematics | VOL. 257

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A CPU–GPU hybrid approach for the unsymmetric multifrontal method

Abstract

Talk to us

Similar Papers

More From: Parallel Computing