Providing performance portable numerics for Intel GPUs

Yu‐Hsiang M Tsai,Terry Cojean,Hartwig Anzt

doi:10.1002/cpe.7400

Providing performance portable numerics for Intel GPUs

Yu‐Hsiang M Tsai, Terry Cojean + Show 1 more

Open Access

https://doi.org/10.1002/cpe.7400

Copy DOI

Journal: Concurrency and computation : practice & experience	Publication Date: Oct 26, 2022
Citations: 1	License type: CC BY 4.0

Affiliation: Karlsruhe Institute of Technology

#Intel GPUs #Programming Environment + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

SummaryWith discrete Intel GPUs entering the high‐performance computing landscape, there is an urgent need for production‐ready software stacks for these platforms. In this article, we report how we enable the Ginkgo math library to execute on Intel GPUs by developing a kernel backed based on the DPC++ programming environment. We discuss conceptual differences between the CUDA and DPC++ programming models and describe workflows for simplified code conversion. We evaluate the performance of basic and advanced sparse linear algebra routines available in Ginkgo's DPC++ backend in the hardware‐specific performance bounds and compare against routines providing the same functionality that ship with Intel's oneMKL vendor library.

Full Text