Performance analysis of hybrid parallel solver for 3D Stokes equation on Intel Xeon computer system

M Ganzha,M Paprzycki,I Lirkov

doi:10.1063/1.5130863

Abstract

In our previous work we have studied the performance of a parallel program, based on a direction splitting approach, solving time dependent Stokes equation. In it, we have used a rectangular uniform mesh, combined with a central difference scheme for the second derivatives. In our work, we were targeting massively parallel computers, as well as clusters of multi-core nodes. Therefore, the developed implementation used hybrid parallelization based on the MPI and OpenMP standards. Specifically, (i) between-node parallelism was supported by using MPI-based communication, while (ii) inside-node parallelism was supported by the OpenMP. In this way, by matching “structure of parallelization” with the architecture of modern large-scale computers, we have attempted at maximizing parallel efficiency of the program.This paper presents an experimental performance study of the developed parallel implementation on a supercomputer using Intel Xeon processors, as well as Intel Xeon Phi co-processors. The experimental results show an essential improvement when running experiments for a variety of problem sizes and number of cores / threads.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Performance analysis of hybrid parallel solver for 3D Stokes equation on Intel Xeon computer system

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

On the Mitigation of Cache Hostile Memory Access Patterns on Many-Core CPU Architectures
Tom Deakin ... Simon Mcintosh-Smith
-
Tom Deakin, et. al.Tom Deakin ... Simon Mcintosh-Smith
01 Jan 2017
01 Jan 2017

Intel Xeon Phi Coprocessor High Performance Programming
...
-
, et. al. ...
01 Jan 2013
01 Jan 2013

Parallel BRDF-based infrared radiation simulation of aerial targets implemented on Intel Xeon processor and Xeon Phi coprocessor
Xing Guo ... Yunhua Cao
Journal of Real-Time Image Processing | VOL. 16
Xing Guo, et. al.Xing Guo ... Yunhua Cao
07 Dec 2017
Journal of Real-Time Image Processing | VOL. 16

Performance analysis of parallel high‐resolution image restoration algorithms on Intel supercomputer
Ivan Lirkov ... Marcin Paprzycki
Concurrency and Computation: Practice and Experience | VOL. 33
Ivan Lirkov, et. al.Ivan Lirkov ... Marcin Paprzycki
25 Sep 2020
Concurrency and Computation: Practice and Experience | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance analysis of hybrid parallel solver for 3D Stokes equation on Intel Xeon computer system

Abstract

Talk to us

Similar Papers