GPU acceleration of an iterative scheme for gas-kinetic model equations with memory reduction techniques

Lianhua Zhu,Peng Wang,Songze Chen,Zhaoli Guo,Yonghao Zhang

doi:10.1016/j.cpc.2019.106861

Abstract

This paper presents a Graphics Processing Unit (GPU) acceleration of an iteration-based discrete velocity method (DVM) for gas-kinetic model equations. Unlike the previous GPU parallelization of explicit kinetic schemes, this work is based on a fast converging iterative scheme. The memory reduction techniques previously proposed for DVM are applied for GPU computing, enabling full three-dimensional (3D) solutions of kinetic model equations in the contemporary GPUs usually with a limited memory capacity that otherwise would need terabytes of memory. The GPU algorithm is validated against the direct simulation Monte Carlo (DSMC) simulation of the 3D lid-driven cavity flow and the supersonic rarefied gas flow past a cube with the phase-space grid points up to 0.7 trillion. The computing performance profiling on three models of GPUs shows that the two main kernel functions can utilize 56%∼79% of the GPU computing and memory resources. The performance of the GPU algorithm is compared with a typical parallel CPU implementation of the same algorithm using the Message Passing Interface (MPI). The comparison shows that the GPU program on K40 and K80 achieves 1.2∼2.8 and 1.2∼2.4 speedups for the 3D lid-driven cavity flow, respectively, compared with the MPI parallelized CPU program running on 96 CPU cores.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

GPU acceleration of an iterative scheme for gas-kinetic model equations with memory reduction techniques

Abstract

Talk to us

Similar Papers

More From: Computer Physics Communications

Lead the way for us

Journal: Computer Physics Communications	Publication Date: Aug 14, 2019
Citations: 15

Similar Papers

GPU-HADVPPM V1.0: a high-efficiency parallel GPU design of the piecewise parabolic method (PPM) for horizontal advection in an air quality model (CAMx V6.10)
Kai Cao ... Nan Wang
Geoscientific Model Development | VOL. 16
Kai Cao, et. al.Kai Cao ... Nan Wang
01 Aug 2023
Geoscientific Model Development | VOL. 16

Parallel hyperbolic PDE simulation on clusters: Cell versus GPU
Scott Rostrup ... Hans De Sterck
Computer Physics Communications | VOL. 181
Scott Rostrup, et. al.Scott Rostrup ... Hans De Sterck
26 Aug 2010
Computer Physics Communications | VOL. 181

Ballooning Graphics Memory Space in Full GPU Virtualization Environments
Younghun Park ... Sungyong Park
Scientific Programming | VOL. 2019
Younghun Park, et. al.Younghun Park ... Sungyong Park
23 Apr 2019
Scientific Programming | VOL. 2019

Graphics processing unit (GPU) programming strategies and trends in GPU computing
André R Brodtkorb ... Martin L Sætra
Journal of Parallel and Distributed Computing | VOL. 73
André R Brodtkorb, et. al.André R Brodtkorb ... Martin L Sætra
04 May 2012
Journal of Parallel and Distributed Computing | VOL. 73

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

GPU acceleration of an iterative scheme for gas-kinetic model equations with memory reduction techniques

Abstract

Talk to us

Similar Papers

More From: Computer Physics Communications