Mesh–particle interpolations on graphics processing units and multicore central processing units

Diego Rossinelli,Petros Koumoutsakos,Christian Conti

doi:10.1098/rsta.2011.0074

Abstract

Particle-mesh interpolations are fundamental operations for particle-in-cell codes, as implemented in vortex methods, plasma dynamics and electrostatics simulations. In these simulations, the mesh is used to solve the field equations and the gradients of the fields are used in order to advance the particles. The time integration of particle trajectories is performed through an extensive resampling of the flow field at the particle locations. The computational performance of this resampling turns out to be limited by the memory bandwidth of the underlying computer architecture. We investigate how mesh-particle interpolation can be efficiently performed on graphics processing units (GPUs) and multicore central processing units (CPUs), and we present two implementation techniques. The single-precision results for the multicore CPU implementation show an acceleration of 45-70×, depending on system size, and an acceleration of 85-155× for the GPU implementation over an efficient single-threaded C++ implementation. In double precision, we observe a performance improvement of 30-40× for the multicore CPU implementation and 20-45× for the GPU implementation. With respect to the 16-threaded standard C++ implementation, the present CPU technique leads to a performance increase of roughly 2.8-3.7× in single precision and 1.7-2.4× in double precision, whereas the GPU technique leads to an improvement of 9× in single precision and 2.2-2.8× in double precision.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mesh–particle interpolations on graphics processing units and multicore central processing units

Abstract

Talk to us

Similar Papers

More From: Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences

Lead the way for us

Journal: Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences	Publication Date: Jun 13, 2011
Citations: 16

Similar Papers

Reduction of computing time for seismic applications based on the Helmholtz equation by Graphics Processing Units

-

03 Mar 2015
03 Mar 2015

Dynamic Heterogeneous scheduling of GPU-CPU in Distributed Environment
Suman Goyat ... Shri Kant
-
Suman Goyat, et. al.Suman Goyat ... Shri Kant
01 Nov 2019
01 Nov 2019

Efficient Utilization of a CPU-GPU Cluster
Gopal Patnaik ... Keith Obenschain
-
Gopal Patnaik, et. al.Gopal Patnaik ... Keith Obenschain
09 Jan 2012
09 Jan 2012

Parallelizing Multiple Flow Accumulation Algorithm using CUDA and OpenACC
Natalija Stojanovic ... Dragan Stojanovic
ISPRS International Journal of Geo-Information | VOL. 8
Natalija Stojanovic, et. al.Natalija Stojanovic ... Dragan Stojanovic
03 Sep 2019
ISPRS International Journal of Geo-Information | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mesh–particle interpolations on graphics processing units and multicore central processing units

Abstract

Talk to us

Similar Papers

More From: Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences