Strategy for data-flow synchronizations in stencil parallel computations on multi-/manycore systems

Lukasz Szustak

doi:10.1007/s11227-018-2239-3

Abstract

In this paper, an innovative strategy for the data-flow synchronization in shared-memory systems is proposed. This strategy assumes to synchronize only interdependent threads instead of using the barrier approach that—in contrast to our approach—synchronize all threads. We demonstrate the adaptation of the data-flow synchronization strategy to two complex scientific applications based on stencil codes. An algorithm for the data-flow synchronization is developed and successfully used for both applications. The proposed approach is evaluated for various Intel microarchitectures released in the last 5 years, including the newest processors: Skylake and Knights Landing. The important part of this assessment is the performance comparison of the proposed data-flow synchronization with the OpenMP barrier. The experimental results show that the performance of the studied applications can be accelerated up to 1.3 times using the proposed data-flow synchronizations strategy.

Highlights

The huge capacity of modern HPC platforms allows complex problems, previously thought impossible, to be solved [12]
We propose an innovative strategy for the data-flow synchronization in shared-memory systems
Since only the adjacent threads depend on each other, we propose to perform the synchronization inside every couple of threads instead of using the barrier approach

Summary

Introduction

The huge capacity of modern HPC platforms allows complex problems, previously thought impossible, to be solved [12]. The EULAG model is an innovative solver in the field of numerical modeling of multiscale geophysical flows Another application area tackled in the work refers to the phase-field method, which is a powerful tool for solving interfacial problems in materials science [11]. The synchronization strategies that base on data-flow communication layers are very popular in distributed-memory programming standards, including MPI or hStreams programming library [5]. The synchronization between the interdependent processing elements is explicitly defined according to communication flows of data, using the specific commands such as MPI_Send and MPI_Recv in the case of MPI This is achieved on a totally different level of programming abstraction than in the proposed approach

Strategy for data-flow synchronization in stencils

Adaptation of data-flow synchronization strategy to MPDATA

Experimental results

Conclusions and future work

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: The Journal of Supercomputing	Publication Date: Jan 6, 2018
Citations: 10	License type: open-access

R Discovery Prime

R Discovery Prime

Strategy for data-flow synchronizations in stencil parallel computations on multi-/manycore systems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: The Journal of Supercomputing

Lead the way for us

Similar Papers

Shared buffer implementations of signal processing systems using lifetime analysis techniques
P.K Murthy ... S.S Bhattacharyya
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 20
P.K Murthy, et. al.P.K Murthy ... S.S Bhattacharyya
01 Jan 2001
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 20

Static Scheduling of Synchronous Data Flow onto Multiprocessors for Embedded DSP Systems
Guoxin Liu ... Liang Guo
-
Guoxin Liu, et. al.Guoxin Liu ... Liang Guo
01 Jan 2010
01 Jan 2010

Resource Optimization for Real-Time Streaming Applications Using Task Replication
Sobhan Niknam ... Todor Stefanov
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 37
Sobhan Niknam, et. al.Sobhan Niknam ... Todor Stefanov
01 Nov 2018
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 37

On GPU optimizations of stencil codes for highly parallel simulations
Nikolai Pfisterer ... Marco Berghoff
-
Nikolai Pfisterer, et. al.Nikolai Pfisterer ... Marco Berghoff
01 Mar 2021
01 Mar 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Strategy for data-flow synchronizations in stencil parallel computations on multi-/manycore systems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: The Journal of Supercomputing