Performance of a three-dimensional unstructured mesh compressible flow solver on NVIDIA Fermi-class graphics processing unit hardware

Jacob Waltz

doi:10.1002/fld.3744

Abstract

SUMMARY We describe the performance of Chicoma, a 3D unstructured mesh compressible flow solver, on graphics processing unit (GPU) hardware. The approach used to deploy the solver on GPU architectures derives from the threaded multicore execution model used in Chicoma, and attempts to improve memory performance via the application of graph theory techniques. The result is a scheme that can be deployed on the GPU with high-level programming constructs, for example, compiler directives, rather than low-level programming extensions. With an NVIDIA Fermi-class GPU (NVIDIA Corp., Sta. Clara, CA, USA) and double precision floating point arithmetic, we observe performance gains of 4–5 × on problem sizes of 106– 107 tetrahedra. We also compare GPU performance to threaded multicore performance with OpenMP and demonstrate hybrid multicore-GPU calculations with adaptive mesh refinement. Published 2012. This article is a US Government work and is in the public domain in the USA.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Performance of a three-dimensional unstructured mesh compressible flow solver on NVIDIA Fermi-class graphics processing unit hardware

Abstract

Talk to us

Similar Papers

More From: International Journal for Numerical Methods in Fluids

Lead the way for us

Journal: International Journal for Numerical Methods in Fluids	Publication Date: Oct 18, 2012
Citations: 14

Similar Papers

MoM software for GPU hardware
Kristie D'Ambrosio ... Ron Pirich
-
Kristie D'Ambrosio, et. al.Kristie D'Ambrosio ... Ron Pirich
01 May 2011
01 May 2011

Method of Moments software for GPU hardware
K D'Ambrosio ... A Kaufman
-
K D'Ambrosio, et. al.K D'Ambrosio ... A Kaufman
01 Nov 2011
01 Nov 2011

GPU-accelerated algorithm for asteroid shape modeling
M Engels ... C Magri
Astronomy and Computing | VOL. 28
M Engels, et. al.M Engels ... C Magri
20 May 2019
Astronomy and Computing | VOL. 28

A study of graphics hardware accelerated particle swarm optimization with digital pheromones
Vijay Kalivarapu ... Eliot Winer
Structural and Multidisciplinary Optimization | VOL. 51
Vijay Kalivarapu, et. al.Vijay Kalivarapu ... Eliot Winer
21 Jan 2015
Structural and Multidisciplinary Optimization | VOL. 51

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance of a three-dimensional unstructured mesh compressible flow solver on NVIDIA Fermi-class graphics processing unit hardware

Abstract

Talk to us

Similar Papers

More From: International Journal for Numerical Methods in Fluids