Hybrid OpenMP/AVX Acceleration of a Higher Order Quiet Direct Simulation Method for the Euler Equations

Matthew R Smith,Ji-Yueh Liu,Fang-An Kuo,Jong-Shin Wu

doi:10.1016/j.proeng.2013.07.108

Abstract

Presented is the Quiet Direct Simulation (QDS) applied to parallel computation using a hybrid OpenMP/AVX parallelization paradigm. Due to the high locality of the QDS scheme, the method has been successfully applied to parallel computation using Graphics Processing Units (GPU) – we show here that the same principles which allow high performance on GPU devices also permit high performance when using Advanced Vector eXtensions (AVX). Furthermore, since modern CPU's employ a large number of cores, we can further extend the performance by using AVX on each available CPU core using shared memory (OpenMP) parallelization. We present a simple direction- split higher order extension to the QDS method, and then apply it to AVX through the use of intrinsic functions in the flux computation and state computation modules. High performance is obtained by ensuring that all flux computations are performed using only AVX intrinsic functions – no computations are performed in serial. Through this approach, a single workstation with 2x Xeon CPU's (16 physical cores) allows a performance increase of over 177 times that of a single core alone. We also demonstrate that built-in optimization does not fully exploit AVX parallelization through the examination of assembly code.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Procedia Engineering	Publication Date: Jan 1, 2013
Citations: 1	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Hybrid OpenMP/AVX Acceleration of a Higher Order Quiet Direct Simulation Method for the Euler Equations

Abstract

Talk to us

Similar Papers

More From: Procedia Engineering

Lead the way for us

Similar Papers

Hybrid OpenMP/AVX acceleration of a Split HLL Finite Volume Method for the Shallow Water and Euler Equations
Ji-Yueh Liu ... Jong-Shin Wu
Computers & Fluids | VOL. 110
Ji-Yueh Liu, et. al.Ji-Yueh Liu ... Jong-Shin Wu
18 Dec 2014
Computers & Fluids | VOL. 110

The implementation of the three-dimensional unified gas-kinetic wave-particle method on multiple graphics processing units
Guochao Fan ... Shaobo Yao
Physics of Fluids | VOL. 35
Guochao Fan, et. al.Guochao Fan ... Shaobo Yao
01 Aug 2023
Physics of Fluids | VOL. 35

A High Throughput Efficient Approach for Decoding LDPC Codes onto GPU Devices
Bertrand Le Gal ... Christophe Jego
IEEE Embedded Systems Letters | VOL. 6
Bertrand Le Gal, et. al.Bertrand Le Gal ... Christophe Jego
01 Jun 2014
IEEE Embedded Systems Letters | VOL. 6

Accelerating a Geometrical Approximated PCA Algorithm Using AVX2 and CUDA
Alina Machidon ... Petre Ogrutan
Remote Sensing | VOL. 12
Alina Machidon, et. al.Alina Machidon ... Petre Ogrutan
13 Jun 2020
Remote Sensing | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hybrid OpenMP/AVX Acceleration of a Higher Order Quiet Direct Simulation Method for the Euler Equations

Abstract

Talk to us

Similar Papers

More From: Procedia Engineering