Heterogeneous parallel computing: from clusters of workstations to hierarchical hybrid platforms

Alexey Lastovetsky

doi:10.14529/jsfi140304

Abstract

The paper overviews the state of the art in design and implementation of data parallel scientific applications on heterogeneous platforms. It covers both traditional approaches originally designed for clusters of heterogeneous workstations and the most recent methods developed in the context of modern multicore and multi-accelerator heterogeneous platforms.

Highlights

High performance computing systems become increasingly heterogeneous and hierarchical
It was shown that even the static distribution based on simplistic performance models improves the performance of traditional dynamic scheduling techniques by up to 250% [44]. In this overview we focus on parallel scientific applications, where computational workload is directly proportional to the size of data, and dedicated HPC platforms, where: (i) the performance of the application is stable in time and is not affected by varying system load; (ii) there is a significant overhead associated with data migration between computing devices; (iii) optimized architecture-specific libraries implementing the same kernels may be available for different computing devices
The CPM can be a sufficiently accurate approximation of the performance of heterogeneous processors executing a data parallel application if: (i) the processors are general-purpose and execute the same code, (ii) the local tasks are small enough to fit in the main memory but large enough not to fully fit in the processor cache

Summary

Introduction

High performance computing systems become increasingly heterogeneous and hierarchical. It was shown that even the static distribution based on simplistic performance models (single values specifying the maximum performance of a dominant computational kernel on CPUs and GPUs) improves the performance of traditional dynamic scheduling techniques by up to 250% [44] In this overview we focus on parallel scientific applications, where computational workload is directly proportional to the size of data, and dedicated HPC platforms, where: (i) the performance of the application is stable in time and is not affected by varying system load; (ii) there is a significant overhead associated with data migration between computing devices; (iii) optimized architecture-specific libraries implementing the same kernels may be available for different computing devices.

Data partitioning algorithms based on constant performance models

Data partitioning algorithms based on functional performance models

Implementation of heterogeneous data partitioning algorithms

Findings

Programming tools

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Supercomputing Frontiers and Innovations	Publication Date: Sep 1, 2014
Citations: 47	License type: cc-by

R Discovery Prime

R Discovery Prime

Heterogeneous parallel computing: from clusters of workstations to hierarchical hybrid platforms

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Supercomputing Frontiers and Innovations

Lead the way for us

Similar Papers

Design and Optimization of Scientific Applications for Highly Heterogeneous and Hierarchical HPC Platforms Using Functional Computation Performance Models
David Clarke ... Vladimir Rychkov
-
David Clarke, et. al.David Clarke ... Vladimir Rychkov
18 Apr 2014
18 Apr 2014

Construction of Artistic Design Patterns Based on Improved Distributed Data Parallel Computing of Heterogeneous Tasks
Yao Sun
Mathematical Problems in Engineering | VOL. 2022
Yao SunYao Sun
31 Mar 2022
Mathematical Problems in Engineering | VOL. 2022

Special Issue 19th international workshop on algorithms, models and tools for parallel computing on heterogeneous platforms (HeteroPar'21)
Rosa M Badia
Concurrency and Computation: Practice and Experience | VOL. 35
Rosa M BadiaRosa M Badia
19 Oct 2022
Concurrency and Computation: Practice and Experience | VOL. 35

An Efficient Stream Buffer Mechanism for Dataflow Execution on Heterogeneous Platforms with GPUs
Ana Balevic ... Bart Kienhuis
-
Ana Balevic, et. al.Ana Balevic ... Bart Kienhuis
01 Oct 2011
01 Oct 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Heterogeneous parallel computing: from clusters of workstations to hierarchical hybrid platforms

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Supercomputing Frontiers and Innovations