Tridigpu: A GPU Library for Block Tridiagonal and Banded Linear Equation Systems

Christoph Klein,Robert Strzodka

doi:10.1145/3580373

Abstract

In this article, we present a CUDA library with a C API for solving block cyclic tridiagonal and banded systems on one GPU. The library can process block tridiagonal systems with block sizes from 1 × 1 (scalar) to 4 × 4 and banded systems with up to four sub- and superdiagonals. For the compute-intensive block size cases and cases with many right-hand sides, we write out an explicit factorization to memory; however, for the scalar case, the fastest approach is to only output the coarse system and recompute the factorization. Prominent features of the library are (scaled) partial pivoting for improved numeric stability; highest-performance kernels, which completely utilize GPU memory bandwidth; and support for multiple sparse or dense right-hand side and solution vectors. The additional memory consumption is only 5% of the original tridiagonal system, which enables the solution of systems up to GPU memory size. The performance of the state-of-the-art scalar tridiagonal solver of cuSPARSE is outperformed by factor 5 for large problem sizes of 2 25 unknowns, on a GeForce RTX 2080 Ti.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Tridigpu: A GPU Library for Block Tridiagonal and Banded Linear Equation Systems

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Parallel Computing

Lead the way for us

Similar Papers

Application of a model order reduction method based on the Krylov subspace to finite element transient analysis imposing several kinds of boundary condition
N M Amin ... Y Sonoda
IOP Conference Series: Materials Science and Engineering | VOL. 10
N M Amin, et. al.N M Amin ... Y Sonoda
01 Jun 2010
IOP Conference Series: Materials Science and Engineering | VOL. 10

Numerical Methods in Linear Algebra
Jitka Segethová ... Karel Segeth
-
Jitka Segethová, et. al.Jitka Segethová ... Karel Segeth
01 Jan 1993
01 Jan 1993

A preconditioning strategy for banded circulant and toeplitz systems
Dušan Caf ... David J Evans
International Journal of Computer Mathematics | VOL. 69
Dušan Caf, et. al.Dušan Caf ... David J Evans
01 Jan 1998
International Journal of Computer Mathematics | VOL. 69

Bridging Dense and Sparse Maximum Inner Product Search
Sebastian Bruch ... Edo Liberty
ACM Transactions on Information Systems | VOL. -
Sebastian Bruch, et. al.Sebastian Bruch ... Edo Liberty
17 May 2024
ACM Transactions on Information Systems | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Tridigpu: A GPU Library for Block Tridiagonal and Banded Linear Equation Systems

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Parallel Computing