Design methodology for workload‐aware loop scheduling strategies based on genetic algorithm and simulation

Pedro H Penna,Henrique C Freitas,François Broquedis,Jean‐François Méhaut,Márcio Castro

doi:10.1002/cpe.3933

Abstract

SummaryIn high‐performance computing, the application's workload must be evenly balanced among threads to deliver cutting‐edge performance and scalability. In OpenMP, the load balancing problem arises when scheduling loop iterations to threads. In this context, several scheduling strategies have been proposed, but they do not take into account the input workload of the application and thus turn out to be suboptimal. In this work, we introduce a design methodology to propose, study, and assess the performance of workload‐aware loop scheduling strategies. In this methodology, a genetic algorithm is employed to explore the state space solution of the problem itself and to guide the design of new loop scheduling strategies, and a simulator is used to evaluate their performance. As a proof of concept, we show how the proposed methodology was used to propose and study a new workload‐aware loop scheduling strategy named smart round‐robin (SRR). We implemented this strategy into GNU Compiler Collection's OpenMP runtime. We carry out several experiments to validate the simulator and to evaluate the performance of SRR. Our experimental results show that SRR may deliver up to 37.89% and 14.10% better performance than OpenMP's dynamic loop scheduling strategy in the simulated environment and in a real‐world application kernel, respectively. Copyright © 2016 John Wiley & Sons, Ltd.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Design methodology for workload‐aware loop scheduling strategies based on genetic algorithm and simulation

Abstract

Talk to us

Similar Papers

More From: Concurrency and Computation: Practice and Experience

Lead the way for us

Journal: Concurrency and Computation: Practice and Experience	Publication Date: Aug 12, 2016
Citations: 6

Similar Papers

Two-Dimensional Dynamic Loop Scheduling Schemes for Computer Clusters
Anthony T Chronopoulos ... Naveen Jayakumar
-
Anthony T Chronopoulos, et. al.Anthony T Chronopoulos ... Naveen Jayakumar
01 Aug 2012
01 Aug 2012

Assessing the Performance of the SRR Loop Scheduler with Irregular Workloads
Pedro H Penna ... Jean-François Méhaut
Procedia Computer Science | VOL. 108
Pedro H Penna, et. al.Pedro H Penna ... Jean-François Méhaut
01 Jan 2017
Procedia Computer Science | VOL. 108

Composing Low-Overhead Scheduling Strategies for Improving Performance of Scientific Applications
Vivek Kale ... William D Gropp
-
Vivek Kale, et. al.Vivek Kale ... William D Gropp
01 Jan 2015
01 Jan 2015

A fault tolerant self-scheduling scheme for parallel loops on shared memory systems
Yizhuo Wang ... Alexander V Veidenbaum
-
Yizhuo Wang, et. al.Yizhuo Wang ... Alexander V Veidenbaum
01 Dec 2012
01 Dec 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Design methodology for workload‐aware loop scheduling strategies based on genetic algorithm and simulation

Abstract

Talk to us

Similar Papers

More From: Concurrency and Computation: Practice and Experience