Performance of a Code Migration for the Simulation of Supersonic Ejector Flow to SMP, MIC, and GPU Using OpenMP, OpenMP+LEO, and OpenACC Directives

C Couder-Castañeda,I Gitler,M Arroyo,H Barrios-Piña

doi:10.1155/2015/739107

Abstract

A serial source code for simulating a supersonic ejector flow is accelerated using parallelization based on OpenMP and OpenACC directives. The purpose is to reduce the development costs and to simplify the maintenance of the application due to the complexity of the FORTRAN source code. This research follows well-proven strategies in order to obtain the best performance in both OpenMP and OpenACC. OpenMP has become the programming standard for scientific multicore software and OpenACC is one true alternative for graphics accelerators without the need of programming low level kernels. The strategies using OpenMP are oriented towards reducing the creation of parallel regions, tasks creation to handle boundary conditions, and a nested control of the loop time for the programming in offload mode specifically for the Xeon Phi. In OpenACC, the strategy focuses on maintaining the data regions among the executions of the kernels. Experiments for performance and validation are conducted here on a 12-core Xeon CPU, Xeon Phi 5110p, and Tesla C2070, obtaining the best performance from the latter. The Tesla C2070 presented an acceleration factor of 9.86X, 1.6X, and 4.5X compared against the serial version on CPU, 12-core Xeon CPU, and Xeon Phi, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Programming	Publication Date: Jan 1, 2015
Citations: 32	License type: CC BY 3.0

R Discovery Prime

R Discovery Prime

Performance of a Code Migration for the Simulation of Supersonic Ejector Flow to SMP, MIC, and GPU Using OpenMP, OpenMP+LEO, and OpenACC Directives

Abstract

Talk to us

Similar Papers

More From: Scientific Programming

Lead the way for us

Similar Papers

Intel Many Integrated Core (MIC) architecture optimization strategies for a memory-bound Weather Research and Forecasting (WRF) Goddard microphysics scheme
Allen H Huang ... Bormin Huang
-
Allen H Huang, et. al.Allen H Huang ... Bormin Huang
21 Oct 2014
21 Oct 2014

Performance tuning Weather Research and Forecasting (WRF) Goddard longwave radiative transfer scheme on Intel Xeon Phi
Allen H Huang ... Bormin Huang
-
Allen H Huang, et. al.Allen H Huang ... Bormin Huang
20 Oct 2015
20 Oct 2015

Optimizing the updated Goddard shortwave radiation Weather Research and Forecasting (WRF) scheme for Intel Many Integrated Core (MIC) architecture
Allen H.-L Huang ... Bormin Huang
-
Allen H.-L Huang, et. al.Allen H.-L Huang ... Bormin Huang
21 May 2015
21 May 2015

Revisiting Intel Xeon Phi optimization of Thompson cloud microphysics scheme in Weather Research and Forecasting (WRF) model
Allen Huang ... Bormin Huang
-
Allen Huang, et. al.Allen Huang ... Bormin Huang
20 Oct 2015
20 Oct 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance of a Code Migration for the Simulation of Supersonic Ejector Flow to SMP, MIC, and GPU Using OpenMP, OpenMP+LEO, and OpenACC Directives

Abstract

Talk to us

Similar Papers

More From: Scientific Programming