Abstract
A double-GPU code is developed to accelerate WENO schemes. The test problem is a compressible viscous flow. The convective terms are discretized using third- to ninth-order WENO schemes and the viscous terms are discretized by the standard fourth-order central scheme. The code written in CUDA programming language is developed by modifying a single-GPU code. The OpenMP library is used for parallel execution of the code on both the GPUs. Data transfer between GPUs which is the main issue in developing the code, is carried out by defining halo points for numerical grids and by using a CUDA built-in function. The code is executed on a PC equipped with two heterogeneous GPUs. The computational times of different schemes are obtained and the speedups with respect to the single-GPU code are reported for different number of grid points. Furthermore, the developed code is analyzed by CUDA profiling tools. The analyze helps to further increase the code performance.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.