GPU and CPU acceleration of a class of kinetic lattice group models

Stefan Brechtken

doi:10.1016/j.camwa.2013.07.002

Stefan Brechtken

Open Access

https://doi.org/10.1016/j.camwa.2013.07.002

Copy DOI

Abstract

We show that typical parallelization strategies used to create parallelized CPU implementations can partially be used to develop efficient GPU implementations, and point out what things should be considered when one compares a CPU with a GPU implementation. For that aim we look at our main concern, the parallelization of a class of lattice group models (LGpMs) for the Boltzmann equation. Thus we give a short overview about the mathematical approach of these models, and then compare the CPU with the GPU architecture to give the basics for the applied parallelization strategies. We do this without going into detail about the underlying C implementation. And finally we use the achieved parallelization speedups of our LGpM parallelization on GPUs and CPUs to compare these two architectures in terms of calculation performance for the given initial costs.

Full Text