Single‐ and multi‐GPU computing on NVIDIA‐ and AMD‐based server platforms for solidification modeling application

Kamil Halbiniak,Norbert Meyer,Krzysztof Rojek

doi:10.1002/cpe.8000

Abstract

SummaryThis work explores the performance of single‐ and multi‐GPU computing on state‐of‐the‐art NVIDIA‐ and AMD‐based server‐class hardware using various programming interfaces to accelerate a real‐world scientific application for solidification modeling based on the phase‐field method. The main computations of this memory‐bound application correspond to 20 stencils computed across grid nodes. We investigate the application's scalability for two basic schemes of organizing computation: without and with hiding data transfers behind computation, combined with using either peer‐to‐peer inter‐GPU data transfers through NVIDIA NVLink and AMD Infinity interconnects or communication over the PCIe and main memory. Among the studied programming interfaces is CUDA, HIP, and OpenMP Accelerator Model. While the first two are designed to write the codes for a specific hardware platform, OpenMP enables code portability between NVIDIA and AMD GPUs. The resulting performance is experimentally assessed on computing platforms containing NVIDIA V100 (up to 8 GPUs) and A100 (one GPU), as well as AMD MI210 (one device) and MI250 (up to 8 logical GPUs).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Single‐ and multi‐GPU computing on NVIDIA‐ and AMD‐based server platforms for solidification modeling application

Abstract

Talk to us

Similar Papers

More From: Concurrency and Computation: Practice and Experience

Lead the way for us

Journal: Concurrency and Computation: Practice and Experience	Publication Date: Dec 27, 2023
Citations: 2

Similar Papers

Exploring the possibility of a hipSYCL-based implementation of oneAPI
Aksel Alpay ... Holger Wünsche
-
Aksel Alpay, et. al.Aksel Alpay ... Holger Wünsche
10 May 2022
10 May 2022

Viability Study of SYCL as a Unified Programming Model for Heterogeneous Systems Based on GPUs in Bioinformatics
Manuel Costanzo
Journal of Computer Science and Technology | VOL. 24
Manuel CostanzoManuel Costanzo
18 Oct 2024
Journal of Computer Science and Technology | VOL. 24

Boda
Matthew W Moskewicz ... Ali Jannesari
-
Matthew W Moskewicz, et. al.Matthew W Moskewicz ... Ali Jannesari
15 May 2017
15 May 2017

New capabilities of the Monte Carlo dose engine ARCHER-RT: Clinical validation of the Varian TrueBeam machine for VMAT external beam radiotherapy.
David P Adam ... Xie George Xu
Medical Physics | VOL. 47
David P Adam, et. al.David P Adam ... Xie George Xu
13 Apr 2020
Medical Physics | VOL. 47

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Single‐ and multi‐GPU computing on NVIDIA‐ and AMD‐based server platforms for solidification modeling application

Abstract

Talk to us

Similar Papers

More From: Concurrency and Computation: Practice and Experience