Heterogeneous CPU+GPU parallelization for high-accuracy scale-resolving simulations of compressible turbulent flows on hybrid supercomputers

Andrey Gorobets,Pavel Bakhvalov

doi:10.1016/j.cpc.2021.108231

Abstract

A heterogeneous parallel algorithm for simulation of compressible turbulent flows and its portable software implementation are presented. The underlying numerical method is based on a family of higher accuracy edge-based reconstruction schemes on unstructured mixed-element meshes. The proposed parallel solution can engage a large number of computing devices of most of the existing computing architectures used in modern supercomputers, including manycore CPUs and GPUs. It is capable of co-execution on both CPUs and accelerators simultaneously. The multilevel parallel algorithm combines: MPI for distributing workload among hybrid cluster nodes and between devices inside nodes; OpenMP for manycore CPUs and other supporting devices, such as Intel Xeon Phi; OpenCL for massively-parallel accelerators, such as GPUs of various vendors, including NVIDIA, AMD, Intel. The main focus is on the adaptation of the numerical method and its computational algorithm to the stream processing parallel paradigm. The very limited device memory inherent in GPU computing is also taken into account. A detailed description of the parallel algorithm is presented, as well as the techniques used for its efficient parallel implementation. Special attention is paid to implicit time integration with its linear solver and calculation of convective fluxes and viscous terms. The use of mixed floating-point precision and overlapping communications and computations is also discussed. Parallel performance is demonstrated in practical applications on different kinds of supercomputers using up to 10 thousand cores and multiple GPUs of comparable overall performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Heterogeneous CPU+GPU parallelization for high-accuracy scale-resolving simulations of compressible turbulent flows on hybrid supercomputers

Abstract

Talk to us

Similar Papers

More From: Computer Physics Communications

Lead the way for us

Journal: Computer Physics Communications	Publication Date: Nov 16, 2021
Citations: 30

Similar Papers

On the use of the discontinuous Galerkin method for numerical simulation of two-dimensional compressible turbulence with shocks
Jian Yu ... Zhenhua Jiang
Science China Physics, Mechanics & Astronomy | VOL. 57
Jian Yu, et. al.Jian Yu ... Zhenhua Jiang
11 Jun 2014
Science China Physics, Mechanics & Astronomy | VOL. 57

Helical model based on artificial neural network for large eddy simulation of compressible wall-bounded turbulent flows
Wanhai Liu ... Changping Yu
Physics of Fluids | VOL. 35
Wanhai Liu, et. al.Wanhai Liu ... Changping Yu
01 Apr 2023
Physics of Fluids | VOL. 35

Large stencil viscous flux linearization for the simulation of 3D compressible turbulent flows with backward-Euler schemes
Jacques Peter ... Frédérique Drullion
Computers & Fluids | VOL. 36
Jacques Peter, et. al.Jacques Peter ... Frédérique Drullion
26 Jan 2007
Computers & Fluids | VOL. 36

GPU‐accelerated direct numerical simulations of decaying compressible turbulence employing a GKM‐based solver
Nishant Parashar ... Balaji Srinivasan
International Journal for Numerical Methods in Fluids | VOL. 83
Nishant Parashar, et. al.Nishant Parashar ... Balaji Srinivasan
08 Sep 2016
International Journal for Numerical Methods in Fluids | VOL. 83

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Heterogeneous CPU+GPU parallelization for high-accuracy scale-resolving simulations of compressible turbulent flows on hybrid supercomputers

Abstract

Talk to us

Similar Papers

More From: Computer Physics Communications