Sparse Cholesky Factorization Algorithms Research Articles

In recent years, there has been widespread adoption of machine learning-based approaches to automate the solving of partial differential equations (PDEs). Among these approaches, Gaussian processes (GPs) and kernel methods have garnered considerable interest due to their flexibility, robust theoretical guarantees, and close ties to traditional methods. They can transform the solving of general nonlinear PDEs into solving quadratic optimization problems with nonlinear, PDE-induced constraints. However, the complexity bottleneck lies in computing with dense kernel matrices obtained from pointwise evaluations of the covariance kernel, and its partial derivatives, a result of the PDE constraint and for which fast algorithms are scarce. The primary goal of this paper is to provide a near-linear complexity algorithm for working with such kernel matrices. We present a sparse Cholesky factorization algorithm for these matrices based on the near-sparsity of the Cholesky factor under a novel ordering of pointwise and derivative measurements. The near-sparsity is rigorously justified by directly connecting the factor to GP regression and exponential decay of basis functions in numerical homogenization. We then employ the Vecchia approximation of GPs, which is optimal in the Kullback-Leibler divergence, to compute the approximate factor. This enables us to compute ϵ \epsilon -approximate inverse Cholesky factors of the kernel matrices with complexity O ( N log d ⁡ ( N / ϵ ) ) O(N\log ^d(N/\epsilon )) in space and O ( N log 2 d ⁡ ( N / ϵ ) ) O(N\log ^{2d}(N/\epsilon )) in time. We integrate sparse Cholesky factorizations into optimization algorithms to obtain fast solvers of the nonlinear PDE. We numerically illustrate our algorithm’s near-linear space/time complexity for a broad class of nonlinear PDEs such as the nonlinear elliptic, Burgers, and Monge-Ampère equations. In summary, we provide a fast, scalable, and accurate method for solving general PDEs with GPs and kernel methods.

SUMMARYSparse Cholesky factorization is the most computationally intensive component in solving large sparse linear systems and is the core algorithm of numerous scientific computing applications. A large number of sparse Cholesky factorization algorithms have previously emerged, exploiting architectural features for various computing platforms. The recent use of graphics processing units (GPUs) to accelerate structured parallel applications shows the potential to achieve significant acceleration relative to desktop performance. However, sparse Cholesky factorization has not been explored sufficiently because of the complexity involved in its efficient implementation and the concerns of low GPU utilization.In this paper, we present a new approach for sparse Cholesky factorization on GPUs. We present the organization of the sparse matrix supernode data structure for GPU and propose a queue‐based approach for the generation and scheduling of GPU tasks with dense linear algebraic operations. We also design a subtree‐based parallel method for multi‐GPU system. These approaches increase GPU utilization, thus resulting in substantial computational time reduction.Comparisons are made with the existing parallel solvers by using problems arising from practical applications. The experiment results show that the proposed approaches can substantially improve sparse Cholesky factorization performance on GPUs. Relative to a highly optimized parallel algorithm on a 12‐core node, we were able to obtain speedups in the range 1.59× to 2.31× by using one GPU and 1.80× to 3.21× by using two GPUs. Relative to a state‐of‐the‐art solver based on supernodal method for CPU‐GPU heterogeneous platform, we were able to obtain speedups in the range 1.52× to 2.30× by using one GPU and 2.15× to 2.76× by using two GPUs. Concurrency and Computation: Practice and Experience, 2013. Copyright © 2013 John Wiley & Sons, Ltd.

Sparse Cholesky Factorization Algorithms Research Articles

Related Topics

Articles published on Sparse Cholesky Factorization Algorithms

Sparse Cholesky factorization for solving nonlinear PDEs via Gaussian processes

Supernodal sparse Cholesky factorization on graphics processing units

Scheduling loops with partial loop-carried dependencies

Dynamic Data Distribution and Processor Repartitioning for Irregularly Structured Computations

Highly scalable parallel algorithms for sparse matrix factorization

Block Sparse Cholesky Algorithms on Advanced Uniprocessor Computers

A Supernodal Cholesky Factorization Algorithm for Shared-Memory Multiprocessors

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Sparse Cholesky Factorization Algorithms Research Articles

Related Topics

Articles published on Sparse Cholesky Factorization Algorithms

Sparse Cholesky factorization for solving nonlinear PDEs via Gaussian processes

Supernodal sparse Cholesky factorization on graphics processing units

Scheduling loops with partial loop-carried dependencies

Dynamic Data Distribution and Processor Repartitioning for Irregularly Structured Computations

Highly scalable parallel algorithms for sparse matrix factorization

Block Sparse Cholesky Algorithms on Advanced Uniprocessor Computers

A Supernodal Cholesky Factorization Algorithm for Shared-Memory Multiprocessors