A parallel MPI + OpenMP + OpenCL algorithm for hybrid supercomputations of incompressible flows

A.V Gorobets,F.X Trias,A Oliva

doi:10.1016/j.compfluid.2013.05.021

Abstract

The work is devoted to the development of efficient parallel algorithms for large-scale simulations of incompressible flows on hybrid supercomputers based on massively-parallel accelerators. The governing equations are discretized using a high-order finite-volume scheme for Cartesian staggered meshes with the only restriction that, at least, one direction is periodic. Its “classical” MPI+OpenMP parallel implementation for CPUs was designed to scale till 100,000 CPU cores. The new hybrid algorithm is developed on a base of a multi-level parallel model that exploits several layers of parallelism of a modern hybrid supercomputer. In this model, MPI and OpenMP are used on the first two levels to couple nodes of a supercomputer and to engage its CPU cores. Then, computing accelerators are further used by means of the hardware independent OpenCL computing standard. In this way, the implementation is adapted to a general computing model with central processors and math co-processors. In this paper the work is focused on adapting the basic operations of the algorithm to architectures of Graphics Processing Units (GPU) without considering the multi-GPU communication scheme. Technology of porting the code to OpenCL is described, certain optimization approaches are presented and relevant performance results obtaining up to 80–90 GFLOPS on a GPU accelerator are demonstrated.Moreover, the experience with different GPU architectures is summarized and a comparison based on the particular application is given for AMD and NVIDIA GPUs as well as for CUDA and OpenCL frameworks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A parallel MPI + OpenMP + OpenCL algorithm for hybrid supercomputations of incompressible flows

Abstract

Talk to us

Similar Papers

More From: Computers & Fluids

Lead the way for us

Journal: Computers & Fluids	Publication Date: Jun 13, 2013
Citations: 28

Similar Papers

Portability for GPU-accelerated molecular docking applications for cloud and HPC: can portable compiler directives provide performance across all platforms?
Mathialakan Thavappiragasam ... Wael Elwasif
-
Mathialakan Thavappiragasam, et. al.Mathialakan Thavappiragasam ... Wael Elwasif
01 May 2022
01 May 2022

Optimized OpenCL implementation of the Elastodynamic Finite Integration Technique for viscoelastic media
M Molero-Armenta ... M.G Hernández
Computer Physics Communications | VOL. 185
M Molero-Armenta, et. al.M Molero-Armenta ... M.G Hernández
28 May 2014
Computer Physics Communications | VOL. 185

Parallel hyperbolic PDE simulation on clusters: Cell versus GPU
Scott Rostrup ... Hans De Sterck
Computer Physics Communications | VOL. 181
Scott Rostrup, et. al.Scott Rostrup ... Hans De Sterck
26 Aug 2010
Computer Physics Communications | VOL. 181

Mars: Accelerating MapReduce with Graphics Processors
Wenbin Fang ... Naga K Govindaraju
IEEE Transactions on Parallel and Distributed Systems | VOL. 22
Wenbin Fang, et. al.Wenbin Fang ... Naga K Govindaraju
01 Apr 2011
IEEE Transactions on Parallel and Distributed Systems | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A parallel MPI + OpenMP + OpenCL algorithm for hybrid supercomputations of incompressible flows

Abstract

Talk to us

Similar Papers

More From: Computers &amp; Fluids

More From: Computers & Fluids