Exploring Scheduling Effects on Task Performance with TaskInsight

Germán Ceballos ,Andra Hugo ,Erik Hägersten ,David Black-Schaffer

doi:10.14529/jsfi170306

Abstract

The complex memory hierarchies of nowadays machines make it very difficult to estimate the execution time of the tasks as depending on where the data is placed in memory, tasks of the same type may end up having different performance. Multiple scheduling heuristics have managed to improve performance by taking into account memory-related properties such as data locality and cache sharing. However, we may see tasks in certain applications or phases of applications that take little or no advantage of these optimizations. Without understanding when such optimizations are effective, we may trigger unnecessary overhead at runtime level. In previous work, we introduced TaskInsight, a technique to characterize how the memory behavior of the application is affected by different task schedulers through the analysis of data reuse across tasks. We now use this tool to dynamically trace the scheduling decisions of multithreaded applications through their execution and analyze how memory reuse can provide information on when and why locality-aware optimizations are effective and impact performance. We demonstrate how we can detect particular scheduling decisions that produced a variation in performance, and the underlying reasons when applying TaskInsight to several of the Montblanc benchmarks. This flexible insight is key both for the programmer and runtime to allow assigning the optimal scheduling policy to certain executions or phases.

Highlights

Scheduling tasks in task-based applications have become significantly more difficult due to overall system complexity, to the deep shared memory hierarchies
The graph shows the percentage of the population as a function of slowdown. This summarizes how many of the experiments have a slowdown larger than X%. Benchmarks such as fft, cholesky, reduction and n-body have a high variation in performance across a significant number of their configurations: 40% of the executions of fft have more than 60% performance difference when changing the scheduling policy; for reduction 30% of the executions have over 30% performance difference; and for cholesky, 40% have differences of over 30%
By combining schedule independent memory access profiling and schedule specific hardware performance counter data we are able to identify which scheduling decisions impact performance, when they happen, and why they cause a problem

Summary

Introduction

Scheduling tasks in task-based applications have become significantly more difficult due to overall system complexity, to the deep shared memory hierarchies. Developers of a task-based application blame this performance degradation on data locality and attempt to characterize their workload based on data reuse without considering the dynamic interaction between the scheduler and the caches [3, 10] This is because there has been no way to obtain precise information on how the data was reused through the execution of the application, such as how long it remained in the caches, and how the scheduling decisions influenced this reuse. We show how applying TaskInsight to the widely adopted Montblanc benchmarks reveals deep insight into why scheduling changed the memory behavior of applications, the key to understanding performance variation across different executions. We cover related previous work (Section 3) to conclude with some remarks on how the TaskInsight analysis enables us to understand other behaviors across the benchmarks and schedulers (Conclusion)

Motivation

Analyzing Performance Variation Due To Scheduling

Related Work

Findings

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploring Scheduling Effects on Task Performance with TaskInsight

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Supercomputing Frontiers and Innovations

Lead the way for us

Journal: Supercomputing Frontiers and Innovations	Publication Date: Jan 1, 2017
License type: cc-by

Similar Papers

Understanding the interplay between task scheduling, memory and performance
Germán Ceballos ... Erik Hagersten
-
Germán Ceballos, et. al.Germán Ceballos ... Erik Hagersten
22 Oct 2017
22 Oct 2017

Dependency-aware Task Scheduling and Cache Placement in Vehicular Networks
Lintao Zhang ... Yuanyu Wang
-
Lintao Zhang, et. al.Lintao Zhang ... Yuanyu Wang
01 Jun 2022
01 Jun 2022

The Importance of Occupational Self-Efficacy Moderation on the Organizational Cynicism Forms and Employee Task and Contextual Performance Relationships
Komal Khalid
Asia Proceedings of Social Sciences | VOL. 2
Komal KhalidKomal Khalid
02 Dec 2018
Asia Proceedings of Social Sciences | VOL. 2

A Cross-Layer View of Optimal Scheduling
Anna Pantelidou ... Anthony Ephremides
IEEE Transactions on Information Theory | VOL. 56
Anna Pantelidou, et. al.Anna Pantelidou ... Anthony Ephremides
01 Nov 2010
IEEE Transactions on Information Theory | VOL. 56

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploring Scheduling Effects on Task Performance with TaskInsight

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Supercomputing Frontiers and Innovations