Developing a Multi-GPU-Enabled Preconditioned GMRES with Inexact Triangular Solves for Block Sparse Matrices

Wanhong Ma ,Xiazhen Liu,Yiwen Hu,Wu Yuan

doi:10.1155/2021/6804723

Abstract

Solving triangular systems is the building block for preconditioned GMRES algorithm. Inexact preconditioning becomes attractive because of the feature of high parallelism on accelerators. In this paper, we propose and implement an iterative, inexact block triangular solve on multi-GPUs based on PETSc’s framework. In addition, by developing a distributed block sparse matrix-vector multiplication procedure and investigating the optimized vector operations, we form the multi-GPU-enabled preconditioned GMRES with the block Jacobi preconditioner. In the implementation, the GPU-Direct technique is employed to avoid host-device memory copies. The preconditioning step used by PETSc’s structure and the cuSPARSE library are also investigated for performance comparisons. The experiments show that the developed GMRES with inexact preconditioning on 8 GPUs can achieve up to 4.4x speedup over the CPU-only implementation with exact preconditioning using 8 MPI processes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Developing a Multi-GPU-Enabled Preconditioned GMRES with Inexact Triangular Solves for Block Sparse Matrices

Abstract

Talk to us

Similar Papers

More From: Mathematical Problems in Engineering

Lead the way for us

Journal: Mathematical Problems in Engineering	Publication Date: Feb 27, 2021
License type: CC BY 4.0

Similar Papers

Performance Portable Supernode-based Sparse Triangular Solver for Manycore Architectures
Ichitaro Yamazaki ... Sivasankaran Rajamanickam
-
Ichitaro Yamazaki, et. al.Ichitaro Yamazaki ... Sivasankaran Rajamanickam
17 Aug 2020
17 Aug 2020

Application of the preconditioned GMRES to the Crank‐Nicolson finite‐difference time‐domain algorithm for 3D full‐wave analysis of planar circuits
Y Yang ... Z H Fan
Microwave and Optical Technology Letters | VOL. 50
Y Yang, et. al.Y Yang ... Z H Fan
27 Mar 2008
Microwave and Optical Technology Letters | VOL. 50

Numerical simulation of microstrip circuits using unconditional stable CN-FDTD method combined with preconditioned GMRES
Y Yang ... R.S Chen
-
Y Yang, et. al.Y Yang ... R.S Chen
01 Apr 2008
01 Apr 2008

Fast finite element electrostatic analysis with domain decomposition method
Siyi Yang ... Bin Li
IEICE Electronics Express | VOL. 20
Siyi Yang, et. al.Siyi Yang ... Bin Li
10 Mar 2023
IEICE Electronics Express | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Developing a Multi-GPU-Enabled Preconditioned GMRES with Inexact Triangular Solves for Block Sparse Matrices

Abstract

Talk to us

Similar Papers

More From: Mathematical Problems in Engineering