Achieving High Performance on Supercomputers with a Sequential Task-based Programming Model

Emmanuel Agullo,Olivier Aumage,Nathalie Furmento,Florent Pruvost,Marc Sergent,Mathieu Faverge,Samuel Paul Thibault

doi:10.1109/tpds.2017.2766064

Abstract

The emergence of accelerators as standard computing resources on supercomputers and the subsequent architectural complexity increase revived the need for high-level parallel programming paradigms. Sequential task-based programming model has been shown to efficiently meet this challenge on a single multicore node possibly enhanced with accelerators, which motivated its support in the OpenMP 4.0 standard. In this paper, we show that this paradigm can also be employed to achieve high performance on modern supercomputers composed of multiple such nodes, with extremely limited changes in the user code. To prove this claim, we have extended the StarPU runtime system with an advanced inter-node data management layer that supports this model by posting communications automatically. We illustrate our discussion with the task- based tile Cholesky algorithm that we implemented on top of this new runtime system layer. We show that it allows for very high productivity while achieving a performance competitive with both the pure Message Passing Interface (MPI)-based ScaLAPACK Cholesky reference implemen- tation and the DPLASMA Cholesky code, which implements another (non sequential) task-based programming paradigm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Achieving High Performance on Supercomputers with a Sequential Task-based Programming Model

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems

Lead the way for us

Journal: IEEE Transactions on Parallel and Distributed Systems	Publication Date: Jun 16, 2016
Citations: 52

Similar Papers

Decentralized in-order execution of a sequential task-based code for shared-memory architectures
Charly Castes ... Emmanuel Agullo
-
Charly Castes, et. al.Charly Castes ... Emmanuel Agullo
01 May 2022
01 May 2022

Performance Analysis of Sequential and Parallel Programming Paradigms on CPU-GPUs Cluster
B N Chandrashekhar ... H A Sanjay
-
B N Chandrashekhar, et. al.B N Chandrashekhar ... H A Sanjay
04 Feb 2021
04 Feb 2021

Revisiting the Sequential Programming Model for Multi-Core
Matthew Bridges ... Thomas Jablin
-
Matthew Bridges, et. al.Matthew Bridges ... Thomas Jablin
01 Dec 2007
01 Dec 2007

Give MPI Threading a Fair Chance: A Study of Multithreaded MPI Designs
Thananon Patinyasakdikul ... David Eberius
-
Thananon Patinyasakdikul, et. al.Thananon Patinyasakdikul ... David Eberius
01 Sep 2019
01 Sep 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Achieving High Performance on Supercomputers with a Sequential Task-based Programming Model

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems