Intel Xeon Phi acceleration of Hybrid Total FETI solver

Michal Merta,Lubomir Riha,Ondrej Meca,Alexandros Markopoulos,Tomas Brzobohaty,Tomas Kozubek,Vit Vondrak

doi:10.1016/j.advengsoft.2017.05.001

Abstract

This paper describes an approach for acceleration of the Hybrid Total FETI (HTFETI) domain decomposition method using the Intel Xeon Phi coprocessors. The HTFETI method is a memory bound algorithm which uses sparse linear BLAS operations with irregular memory access pattern. The presented local Schur complement (LSC) method has regular memory access pattern, that allows the solver to fully utilize the Intel Xeon Phi fast memory bandwidth.This translates to speedup over 10.9 of the HTFETI iterative solver when solving 3 billion unknown heat transfer problem (3D Laplace equation) on almost 400 compute nodes. The comparison is between the CPU computation using sparse data structures (PARDISO sparse direct solver) and the LSC computation on Xeon Phi. In the case of the structural mechanics problem (3D linear elasticity) of size 1 billion DOFs the respective speedup is 3.4.The presented speedups are asymptotic and they are reached for problems requiring high number of iterations (e.g., ill-conditioned problems, transient problems, contact problems). For problems which can be solved with under hundred iterations the local Schur complement method is not optimal. For these cases we have implemented sparse matrix processing using PARDISO also for the Xeon Phi accelerators.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Intel Xeon Phi acceleration of Hybrid Total FETI solver

Abstract

Talk to us

Similar Papers

More From: Advances in Engineering Software

Lead the way for us

Journal: Advances in Engineering Software	Publication Date: May 10, 2017
Citations: 11

Similar Papers

Efficient Strategies of Compressing Three-Dimensional Sparse Arrays Based on Intel XEON and Intel XEON Phi Environments
Chun-Yuan Lin ... Che-Lun Hung
-
Chun-Yuan Lin, et. al.Chun-Yuan Lin ... Che-Lun Hung
01 Oct 2015
01 Oct 2015

On the Mitigation of Cache Hostile Memory Access Patterns on Many-Core CPU Architectures
Tom Deakin ... Simon Mcintosh-Smith
-
Tom Deakin, et. al.Tom Deakin ... Simon Mcintosh-Smith
01 Jan 2017
01 Jan 2017

AstroPhi: A code for complex simulation of the dynamics of astrophysical objects using hybrid supercomputers
I.M Kulikov ... A.V Tutukov
Computer Physics Communications | VOL. 186
I.M Kulikov, et. al.I.M Kulikov ... A.V Tutukov
17 Sep 2014
Computer Physics Communications | VOL. 186

Parallelized Simulation of a Finite Element Method in Many Integrated Core Architecture
Moonho Tak ... Taehyo Park
Journal of Engineering Materials and Technology | VOL. 139
Moonho Tak, et. al.Moonho Tak ... Taehyo Park
07 Feb 2017
Journal of Engineering Materials and Technology | VOL. 139

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Intel Xeon Phi acceleration of Hybrid Total FETI solver

Abstract

Talk to us

Similar Papers

More From: Advances in Engineering Software