Lower-upper Factorization Research Articles

Jacobian clustering is proposed to reduce the computational cost associated with matrix operations encountered in the Newton iteration in fully coupled, fully implicit schemes for unsteady reactive flow simulations with detailed chemistry. The iterative solver is based on the Lower-Upper Symmetric Gauss-Seidel (LUSGS) algorithm and sparse matrix technique. The evaluation and sparse Lower-Upper (LU) factorization of the diagonal block of the system Jacobian are performed within clusters rather than individual cells for Computational Fluid Dynamics (CFD) simulations. Cells that are close in the state space are clustered to provide the averaged states for calculating the Jacobians of chemical source terms and transport fluxes. The cells retrieve the factorized sparse matrices from the belonging clusters to perform the necessary iterations. For the purpose of clustering, the spatial dependency of transport Jacobian in the diagonal block is eliminated. To further reduce the computational cost, the sparsity of chemical Jacobian is augmented by removing the insignificant matrix elements. The method is tested in various one-dimensional hydrocarbon flames with both the second order Crank-Nicolson scheme and a third order implicit Runge-Kutta scheme. Various chemical mechanisms with 9 to 111 species are used to test the performance of the iterative solver. Fast convergence of Newton iteration is achieved, and the formal order of accuracy is demonstrated with Jacobian clustering. The overall costs of evaluation and factorization of the block diagonal Jacobian are negligible compared to the cost of calculating transport fluxes and chemical source terms. The averaged costs of Jacobian evaluation, LU factorization and Newton iteration, all increase only linearly with the number of chemical species. The fully coupled, fully implicit Crank-Nicolson scheme with Jacobian clustering shows 4 to 42 times speedup in computational time compared to the decoupled implicit scheme with Strang operator splitting. Jacobian clustering is promising to increase the computational efficiency of high order fully coupled, fully implicit schemes for unsteady reactive flow simulations with detailed chemistry.

Lower upper (LU) factorization for sparse matrices is the most important computing step for circuit simulation problems. However, parallelizing LU factorization on the graphic processing units (GPUs) turns out to be a difficult problem due to intrinsic data dependence and irregular memory access, which diminish GPU computing power. In this paper, we propose a new sparse LU solver on GPUs for circuit simulation and more general scientific computing. The new method, which is called GPU accelerated LU factorization (GLU) solver (for GPU LU), is based on a hybrid right-looking LU factorization algorithm for sparse matrices. We show that more concurrency can be exploited in the right-looking method than the left-looking method, which is more popular for circuit analysis, on GPU platforms. At the same time, the GLU also preserves the benefit of column-based left-looking LU method, such as symbolic analysis and column-level concurrency. We show that the resulting new parallel GPU LU solver allows the parallelization of all three loops in the LU factorization on GPUs. While in contrast, the existing GPU-based left-looking LU factorization approach can only allow parallelization of two loops. Experimental results show that the proposed GLU solver can deliver $5.71\times $ and $1.46\times $ speedup over the single-threaded and the 16-threaded PARDISO solvers, respectively, $19.56\times $ speedup over the KLU solver, $47.13\times $ over the UMFPACK solver, and $1.47\times $ speedup over a recently proposed GPU-based left-looking LU solver on the set of typical circuit matrices from the University of Florida (UFL) sparse matrix collection. Furthermore, we also compare the proposed GLU solver on a set of general matrices from the UFL, GLU achieves $6.38\times $ and $1.12\times $ speedup over the single-threaded and the 16-threaded PARDISO solvers, respectively, $39.39\times $ speedup over the KLU solver, $24.04\times $ over the UMFPACK solver, and $2.35\times $ speedup over the same GPU-based left-looking LU solver. In addition, comparison on self-generated $RLC$ mesh networks shows a similar trend, which further validates the advantage of the proposed method over the existing sparse LU solvers.

Lower-upper Factorization Research Articles

Related Topics

Articles published on Lower-upper Factorization

NUMA-aware parallel sparse LU factorization for SPICE-based circuit simulators on ARM multi-core processors

Unique Symbolic Factorization for Fast Contingency Analysis Using Full Newton–Raphson Method

Orthogonal Time Frequency Space Detection via Low-Complexity Expectation Propagation

A fully coupled, fully implicit simulation method for unsteady flames using Jacobian approximation and clustering

Approximation of P-, S1-, and S2-wave reflection coefficients for orthorhombic media

A novel and efficient engine for P-/S-wave-mode vector decomposition for vertical transverse isotropic elastic reverse time migration

Accurate 3D frequency-domain seismic wave modeling with the wavelength-adaptive 27-point finite-difference stencil: A tool for full-waveform inversion

Numerical Simulation of Multiphase Multicomponent Flow in Porous Media: Efficiency Analysis of Newton-Based Method

Performance Analysis of the χMD Matrix Solver Package for MODFLOW-USG.

A spectral element method to compute approximations of the anisotropic diffusion operator with bidimensional tensor coefficient

Explicit nonlinear predictive control algorithms for Laguerre filter and sparse least square support vector machine-based Wiener model

GLU3.0: Fast GPU-based Parallel Sparse LU Factorization for Circuit Simulation

Online Thevenin Equivalent Parameter Identification Method of Large Power Grids Using LU Factorization

Three-Dimensional Wide-Band Electromagnetic Forward Modelling Using Potential Technique

High-Resolution Three-Dimensional Displacement Retrieval of Mining Areas From a Single SAR Amplitude Pair Using the SPIKE Algorithm

Stochastic LU factorizations, Darboux transformations and urn models

An Efficient Sixth-Order Newton-Type Method for Solving Nonlinear Systems

An optimized high payload audio watermarking algorithm based on LU-factorization

LU factorization on heterogeneous systems: an energy-efficient approach towards high performance

GPU-Accelerated Parallel Sparse LU Factorization Method for Fast Circuit Analysis

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Lower-upper Factorization Research Articles

Related Topics

Articles published on Lower-upper Factorization

NUMA-aware parallel sparse LU factorization for SPICE-based circuit simulators on ARM multi-core processors

Unique Symbolic Factorization for Fast Contingency Analysis Using Full Newton–Raphson Method

Orthogonal Time Frequency Space Detection via Low-Complexity Expectation Propagation

A fully coupled, fully implicit simulation method for unsteady flames using Jacobian approximation and clustering

Approximation of P-, S1-, and S2-wave reflection coefficients for orthorhombic media

A novel and efficient engine for P-/S-wave-mode vector decomposition for vertical transverse isotropic elastic reverse time migration

Accurate 3D frequency-domain seismic wave modeling with the wavelength-adaptive 27-point finite-difference stencil: A tool for full-waveform inversion

Numerical Simulation of Multiphase Multicomponent Flow in Porous Media: Efficiency Analysis of Newton-Based Method

Performance Analysis of the χMD Matrix Solver Package for MODFLOW-USG.

A spectral element method to compute approximations of the anisotropic diffusion operator with bidimensional tensor coefficient

Explicit nonlinear predictive control algorithms for Laguerre filter and sparse least square support vector machine-based Wiener model

GLU3.0: Fast GPU-based Parallel Sparse LU Factorization for Circuit Simulation

Online Thevenin Equivalent Parameter Identification Method of Large Power Grids Using LU Factorization

Three-Dimensional Wide-Band Electromagnetic Forward Modelling Using Potential Technique

High-Resolution Three-Dimensional Displacement Retrieval of Mining Areas From a Single SAR Amplitude Pair Using the SPIKE Algorithm

Stochastic LU factorizations, Darboux transformations and urn models

An Efficient Sixth-Order Newton-Type Method for Solving Nonlinear Systems

An optimized high payload audio watermarking algorithm based on LU-factorization

LU factorization on heterogeneous systems: an energy-efficient approach towards high performance

GPU-Accelerated Parallel Sparse LU Factorization Method for Fast Circuit Analysis