Performance Optimization of 3D Lattice Boltzmann Flow Solver on a GPU

Nhat-Phuong Tran,Sugwon Hong,Myungho Lee

doi:10.1155/2017/1205892

Abstract

Lattice Boltzmann Method (LBM) is a powerful numerical simulation method of the fluid flow. With its data parallel nature, it is a promising candidate for a parallel implementation on a GPU. The LBM, however, is heavily data intensive and memory bound. In particular, moving the data to the adjacent cells in the streaming computation phase incurs a lot of uncoalesced accesses on the GPU which affects the overall performance. Furthermore, the main computation kernels of the LBM use a large number of registers per thread which limits the thread parallelism available at the run time due to the fixed number of registers on the GPU. In this paper, we develop high performance parallelization of the LBM on a GPU by minimizing the overheads associated with the uncoalesced memory accesses while improving the cache locality using the tiling optimization with the data layout change. Furthermore, we aggressively reduce the register uses for the LBM kernels in order to increase the run-time thread parallelism. Experimental results on the Nvidia Tesla K20 GPU show that our approach delivers impressive throughput performance: 1210.63 Million Lattice Updates Per Second (MLUPS).

Highlights

Lattice Boltzmann Method (LBM) is a powerful numerical simulation method of the fluid flow, originating from the lattice gas automata methods [1]
We show the performance improvements of our optimization techniques compared with the previous approach based on the Structure of arrays (SoA) Pull implementation: (i) Figure 16 compares the average performance of the SoA with and without removing the branch divergences explained in Section 4.4 in the kernel code
In order to improve the cache locality and minimize the overheads associated with the uncoalesced accesses in moving the data to the adjacent cells in the streaming phase of the LBM, we used the tiling optimization with the data layout change

Summary

Introduction

Lattice Boltzmann Method (LBM) is a powerful numerical simulation method of the fluid flow, originating from the lattice gas automata methods [1]. LBM models the fluid flow consisting of particles moving with random motions. Such particles exchange the momentum and the energy through the streaming and the collision processes over the discrete lattice grid in the discrete time steps. The architecture of the GPU has gone through a number of innovative design changes in the last decade. It is integrated with a large number of cores and multiple threads per core, levels of the cache hierarchies, and the large amount (>5 GB) of the on-board memory. The advanced GPU architecture and the flexible programming environments have made possible innovative performance improvements in many application areas

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific programming	Publication Date: Jan 1, 2017
Citations: 22	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Performance Optimization of 3D Lattice Boltzmann Flow Solver on a GPU

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific programming

Lead the way for us

Similar Papers

Memory-Efficient Parallelization of 3D Lattice Boltzmann Flow Solver on a GPU
Nhat-Phuong Tran ... Myungho Lee
-
Nhat-Phuong Tran, et. al.Nhat-Phuong Tran ... Myungho Lee
01 Dec 2015
01 Dec 2015

Lattice Boltzmann methods for single-phase and solid-liquid phase-change heat transfer in porous media: A review
Ya-Ling He ... Wen-Quan Tao
International Journal of Heat and Mass Transfer | VOL. 129
Ya-Ling He, et. al.Ya-Ling He ... Wen-Quan Tao
27 Sep 2018
International Journal of Heat and Mass Transfer | VOL. 129

Application of Coupled Lattice Boltzmann and Phase-Field Methods for Multiphase Flow Simulations
Kannan N Premnath ... D V Patil
-
Kannan N Premnath, et. al.Kannan N Premnath ... D V Patil
14 Jul 2013
14 Jul 2013

Lattice Boltzmann method on quadtree grids for simulating fluid flow through porous media: A new automatic algorithm
Sajjad Foroughi ... Mohsen Masihi
Physica D: Nonlinear Phenomena | VOL. 392
Sajjad Foroughi, et. al.Sajjad Foroughi ... Mohsen Masihi
05 Jun 2013
Physica D: Nonlinear Phenomena | VOL. 392

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance Optimization of 3D Lattice Boltzmann Flow Solver on a GPU

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific programming