Low learning-cost offline strategies for EDP optimization of parallel applications

Gustavo Paim Berned,Fábio D Rossi,Antonio Carlos S Beck,Samuel Xavier De Souza,Arthur F Lorenzon,Marcelo C Luizelli

doi:10.1016/j.sysarc.2020.101959

Abstract

Abstract Many parallel applications do not scale with the number of threads. Several online and offline strategies have been proposed in order to optimize this number. While the former strategy can capture some behaviors that can only be known at runtime, the latter do not impose any execution overhead and can use more complex and efficient algorithms. However, the learning algorithm in these offline strategies may take several hours, precluding their use or a smooth portability across different systems. In this scenario, we propose a methodology to decrease the learning time of offline strategies by inferring the execution behavior of parallel applications using smaller input sets than the ones used by the target applications. It implements two search strategies: SEA, where all parallel regions of an application run with the same number of threads; and SPRA, which seeks to find an ideal number of threads for each parallel region of a given application. With an extensive set of experiments, we show that SEA and SPRA strategies converge to results close to an offline approach applied over the regular input, but being 88% and 87% faster, on average, respectively. We also show that SPRA is better than SEA for unbalanced applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Low learning-cost offline strategies for EDP optimization of parallel applications

Abstract

Talk to us

Similar Papers

More From: Journal of Systems Architecture

Lead the way for us

Journal: Journal of Systems Architecture	Publication Date: Dec 2, 2020
Citations: 4

Similar Papers

Decreasing the Learning Cost of Offline Parallel Application Optimization Strategies
Gustavo Berned ... Fabio D Rossi
-
Gustavo Berned, et. al.Gustavo Berned ... Fabio D Rossi
01 Mar 2020
01 Mar 2020

Dynamic concurrency throttling on NUMA systems and data migration impacts
Janaina Schwarzrock ... Arthur F Lorenzon
Design Automation for Embedded Systems | VOL. 25
Janaina Schwarzrock, et. al.Janaina Schwarzrock ... Arthur F Lorenzon
04 Nov 2020
Design Automation for Embedded Systems | VOL. 25

Performance Model for Master/Worker Hybrid Applications on Multicore Clusters
Abel Castellanos ... Tomas Margalef
-
Abel Castellanos, et. al.Abel Castellanos ... Tomas Margalef
01 Nov 2013
01 Nov 2013

Design and Implementation of Hybrid and Native Communication Devices for Java HPC
Bibrak Qamar ... Bryan Carpenter
Procedia Computer Science | VOL. 29
Bibrak Qamar, et. al.Bibrak Qamar ... Bryan Carpenter
01 Jan 2014
Procedia Computer Science | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Low learning-cost offline strategies for EDP optimization of parallel applications

Abstract

Talk to us

Similar Papers

More From: Journal of Systems Architecture