Dense Matrix Operations Research Articles

This work presents MARE2DEM, a freely available code for 2-D anisotropic inversion of magnetotelluric (MT) data and frequency-domain controlled-source electromagnetic (CSEM) data from onshore and offshore surveys. MARE2DEM parametrizes the inverse model using a grid of arbitrarily shaped polygons, where unstructured triangular or quadrilateral grids are typically used due to their ease of construction. Unstructured grids provide significantly more geometric flexibility and parameter efficiency than the structured rectangular grids commonly used by most other inversion codes. Transmitter and receiver components located on topographic slopes can be tilted parallel to the boundary so that the simulated electromagnetic fields accurately reproduce the real survey geometry. The forward solution is implemented with a goal-oriented adaptive finite-element method that automatically generates and refines unstructured triangular element grids that conform to the inversion parameter grid, ensuring accurate responses as the model conductivity changes. This dual-grid approach is significantly more efficient than the conventional use of a single grid for both the forward and inverse meshes since the more detailed finite-element meshes required for accurate responses do not increase the memory requirements of the inverse problem. Forward solutions are computed in parallel with a highly efficient scaling by partitioning the data into smaller independent modeling tasks consisting of subsets of the input frequencies, transmitters and receivers. Non-linear inversion is carried out with a new Occam inversion approach that requires fewer forward calls. Dense matrix operations are optimized for memory and parallel scalability using the ScaLAPACK parallel library. Free parameters can be bounded using a new non-linear transformation that leaves the transformed parameters nearly the same as the original parameters within the bounds, thereby reducing non-linear smoothing effects. Data balancing normalization weights for the joint inversion of two or more data sets encourages the inversion to fit each data type equally well. A synthetic joint inversion of marine CSEM and MT data illustrates the algorithm's performance and parallel scaling on up to 480 processing cores. CSEM inversion of data from the Middle America Trench offshore Nicaragua demonstrates a real world application. The source code and MATLAB interface tools are freely available at http://mare2dem.ucsd.edu.

A sparse stiff chemistry solver based on dynamic adaptive hybrid integration (AHI-S) is developed and demonstrated for efficient combustion simulations. In a previous study, a dynamic adaptive method for hybrid integration (AHI) was developed to speed up the time integration of chemically reacting flows with detailed chemistry. The AHI method solves the fast subcomponent of chemistry implicitly and the slow subcomponent of chemistry and transport explicitly, and it was shown that AHI is more accurate and efficient than the operator-splitting schemes when there are significant radical sources from the transport term. In the present study, the AHI method is first improved to minimize the number of nontrivial entries in the Jacobian. Sparse matrix techniques are further integrated into AHI to achieve high computational efficiency. The performance of the new AHI-S solver is investigated in constant-pressure auto-ignition systems using different mechanisms that consist of 9–2878 species. It is shown that the computational cost of the AHI-S solver is overall linearly proportional to the mechanism size and is comparable to that of evaluating reaction rates using CHEMKIN-II subroutines. The AHI-S solver achieves speed-up factors ranging from approximately 10, for the 9-species hydrogen mechanism, to approximately 3000, for the 2878-species biodiesel mechanism, compared with the fully implicit VODE solver with Jacobian evaluated through numerical perturbations and factorized with dense matrix operations. It is further found that for mechanisms with less than approximately 100 species, the time saving of AHI-S is primarily attributed to the reduced size of the implicit core of the governing equations, while for mechanisms with more than 100 species, the computational cost of VODE is dominated by the dense LU factorization, such that the time saving of AHI-S is mostly attributed to the sparse LU factorization. The AHI-S solver is then applied to unsteady perfectly stirred reactors involving extinction and re-ignition. Speed-up factors from 50 to 30,000 are achieved compared with the Strang splitting scheme with the chemistry substeps implicitly integrated with VODE, while speed-up factors of 10–100 are achieved compared with the Strang splitting scheme implemented with the sparse stiff LSODES solver. In the end, the performance of AHI-S is investigated in one-dimensional (1-D) unsteady freely propagating laminar premixed flames for a methane/air mixture, for which the time step size in AHI-S is limited by the fastest transport process. A speed-up factor of approximately 200 is achieved compared with the Strang splitting scheme for fixed time step sizes between 10−8s and 10−6s.

Dense Matrix Operations Research Articles

Related Topics

Articles published on Dense Matrix Operations

Distributed out-of-memory NMF on CPU/GPU architectures

Unleashing the Potential of Sparse DNNs Through Synergistic Hardware-Sparsity Co-Design

Very fast finite element Poisson solvers on lower precision accelerator hardware: A proof of concept study for Nvidia Tesla V100

Towards electronic structure-based ab-initio molecular dynamics simulations with hundreds of millions of atoms

Tensorox: Accelerating GPU Applications via Neural Approximation on Unused Tensor Cores

The Second Competition on Spatial Statistics for Large Datasets

Multicolor low‐rank preconditioner for general sparse linear systems

GPU Acceleration of Dense Matrix and Block Operations for Lanczos Method for Systems Over GF(2)

Large-scale electromagnetic field analyses of coils wound with coated conductors using a current-vector-potential formulation with a thin-strip approximation

Network Coding in Heterogeneous Multicore IoT Nodes With DAG Scheduling of Parallel Matrix Block Operations

MARE2DEM: a 2-D inversion code for controlled-source electromagnetic and magnetotelluric data

A sparse stiff chemistry solver based on dynamic adaptive integration for efficient combustion simulations

A fast, memory efficient and robust sparse preconditioner based on a multifrontal approach with applications to finite‐element matrices

Implementation of a generalized exponential basis functions method for linear and non‐linear problems

Faster solvers for large kinetic mechanisms using adaptive preconditioners

The density matrix renormalization group algorithm on kilo-processor architectures: Implementation and trade-offs

Solving an elliptic PDE eigenvalue problem via automated multi-level substructuring and hierarchical matrices

The performance of GRAPE-DR for dense matrix operations

A High Performance Multifrontal Code for Linear Solution of Structures Using Multi-Core Microprocessors

An MIMD strategy for quantum mechanical reactive scattering calculations

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Dense Matrix Operations Research Articles

Related Topics

Articles published on Dense Matrix Operations

Distributed out-of-memory NMF on CPU/GPU architectures

Unleashing the Potential of Sparse DNNs Through Synergistic Hardware-Sparsity Co-Design

Very fast finite element Poisson solvers on lower precision accelerator hardware: A proof of concept study for Nvidia Tesla V100

Towards electronic structure-based ab-initio molecular dynamics simulations with hundreds of millions of atoms

Tensorox: Accelerating GPU Applications via Neural Approximation on Unused Tensor Cores

The Second Competition on Spatial Statistics for Large Datasets

Multicolor low‐rank preconditioner for general sparse linear systems

GPU Acceleration of Dense Matrix and Block Operations for Lanczos Method for Systems Over GF(2)

Large-scale electromagnetic field analyses of coils wound with coated conductors using a current-vector-potential formulation with a thin-strip approximation

Network Coding in Heterogeneous Multicore IoT Nodes With DAG Scheduling of Parallel Matrix Block Operations

MARE2DEM: a 2-D inversion code for controlled-source electromagnetic and magnetotelluric data

A sparse stiff chemistry solver based on dynamic adaptive integration for efficient combustion simulations

A fast, memory efficient and robust sparse preconditioner based on a multifrontal approach with applications to finite‐element matrices

Implementation of a generalized exponential basis functions method for linear and non‐linear problems

Faster solvers for large kinetic mechanisms using adaptive preconditioners

The density matrix renormalization group algorithm on kilo-processor architectures: Implementation and trade-offs

Solving an elliptic PDE eigenvalue problem via automated multi-level substructuring and hierarchical matrices

The performance of GRAPE-DR for dense matrix operations

A High Performance Multifrontal Code for Linear Solution of Structures Using Multi-Core Microprocessors

An MIMD strategy for quantum mechanical reactive scattering calculations