Abstract

Some high performance computing (HPC) applications exhibit increasing real-time requirements, which call for effective means to predict their high execution times distribution. This is a new challenge for HPC applications but a well-known problem for real-time embedded applications where solutions already exist, although they target low-performance systems running single-threaded applications. In this paper, we show how some performance validation and measurement-based practices for real-time execution time prediction can be leveraged in the context of HPC applications on high-performance platforms, thus enabling reliable means to obtain real-time guarantees for those applications. In particular, the proposed methodology uses coordinately techniques that randomly explore potential timing behavior of the application together with Extreme Value Theory (EVT) to predict rare (and high) execution times to, eventually, derive probabilistic Worst-Case Execution Time (pWCET) curves. We demonstrate the effectiveness of this approach for an acoustic wave inversion application used for geophysical exploration.

Highlights

  • High performance computing (HPC) systems have become ubiquitous and are no longer concentrated in supercomputing facilities and data centers

  • Contribution 3: Evaluation and scalability. We evaluate those techniques on the geophysical exploration application, proving their viability to study its execution time behavior, and showing that appropriate integration of those techniques allows scaling the application to the use of parallel paradigms, beyond the execution conditions considered in embedded systems

  • I.i.d. properties may not hold for HPC applications, but, as long as execution time is large enough, i.i.d holds for maxima, which is enough for a reliable application of measurement-based probabilistic timing analysis (MBPTA)-CV

Read more

Summary

Introduction

High performance computing (HPC) systems have become ubiquitous and are no longer concentrated in supercomputing facilities and data centers. The proliferation of HPC systems and applications in new domains has led to new requirements related to Quality of Service (QoS) and the implementation of platforms to satisfy them [1,2,3,4,5]. Mathematics 2020, 8, 314 modeling the propagation of hazardous substances after an accident, needs to be completed in a given short time frame in order to be useful

Motivation
Contribution
Execution Time Test Coverage Improvement for HPC
Memory-Placement Software Randomization
Code Randomization
Stack Randomization
Heap Randomization
Summary
Measurement-Based Probabilistic Timing Analysis for HPC
MBPTA-CV Fundamentals
MBPTA-CV Steps
Independence and Identical Distribution
Exponential Tail Test
Select the Best Tail
Case study
Experimental Framework
Results
Related Work
Conclusions
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.