KokkACC: Enhancing Kokkos with OpenACC

Pedro Valero-Lara,Marc Gonzalez-Tallada,Joel Denny,Seyong Lee,Jeffrey S Vetter

doi:10.1109/waccpd56842.2022.00009

Abstract

Template metaprogramming is gaining popularity as a high-level solution for achieving performance portability on heterogeneous computing resources. Kokkos is a representative approach that offers programmers high-level abstractions for generic programming while most of the device-specific code generation and optimizations are delegated to the compiler through template specializations. For this, Kokkos provides a set of device-specific code specializations in multiple back ends, such as CUDA and HIP. Unlike CUDA or HIP, OpenACC is a high-level and directive-based programming model. This descriptive model allows developers to insert hints (pragmas) into their code that help the compiler to parallelize the code. The compiler is responsible for the transformation of the code, which is completely transparent to the programmer. This paper presents an OpenACC back end for Kokkos: KokkACC. As an alternative to Kokkos’s existing device-specific back ends, KokkACC is a multi-architecture back end providing a high-productivity programming environment enabled by OpenACC’s high-level and descriptive programming model. Moreover, we have observed competitive performance; in some cases, KokkACC is faster (up to 9×) than NVIDIA’s CUDA back end and much faster than OpenMP’s GPU offloading back end. This work also includes implementation details and a detailed performance study conducted with a set of mini-benchmarks (AXPY and DOT product) and three mini-apps (LULESH, miniFE and SNAP, a LAMMPS proxy mini-app).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

KokkACC: Enhancing Kokkos with OpenACC

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

OpenMP to CUDA graphs
Chenle Yu ... Eduardo Quiñones
-
Chenle Yu, et. al.Chenle Yu ... Eduardo Quiñones
25 May 2020
25 May 2020

ClusterGOP: A high-level parallel programming environment

-

15 Aug 2004
15 Aug 2004

Evaluating directive-based programming models on Wave Propagation Kernels
S Rodriguez
-
S RodriguezS Rodriguez
12 Jun 2017
12 Jun 2017

A Multi-Level Platform-Independent GPU API for High-Level Programming Models
Akihiro Hayashi ... Vivek Sarkar
-
Akihiro Hayashi, et. al.Akihiro Hayashi ... Vivek Sarkar
01 Jan 2021
01 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

KokkACC: Enhancing Kokkos with OpenACC

Abstract

Talk to us

Similar Papers