A new Graphics Processing Unit (GPU) parallelization strategy is proposed to accelerate sparse finite element computation for three dimensional electromagnetic analysis. The parallelization strategy is employed based on a new compression format called sliced ELL Four (sliced ELL-F). The sliced ELL-F format-based parallelization strategy is designed for hastening many addition, dot product, and Sparse Matrix Vector Product (SMVP) operations in the Conjugate Gradient Norm (CGN) calculation of finite element equations. The new implementation of SMVP on GPUs is evaluated. The proposed strategy executed on a GPU can efficiently solve sparse finite element equations, especially when the equations are huge sparse (size of most rows in a coefficient matrix is less than 8). Numerical results show the sliced ELL-F format-based parallelization strategy can reach significant speedups compared to Compressed Sparse Row (CSR) format.
Read full abstract