A Minimally Intrusive Approach for Automatic Assessment of Parallel Performance Scalability of Shared-Memory HPC Applications

Vitor Ramos Gomes Da Silva,Anderson Bráulio Nóbrega Da Silva,Pierre Manneback,Samuel Xavier-De-Souza,Carlos Valderrama

doi:10.3390/electronics11050689

Abstract

High-performance computing systems have become increasingly dynamic, complex, and unpredictable. To help build software that uses full-system capabilities, performance measurement and analysis tools exploit extensive execution analysis focusing on single-run results. Despite being effective in identifying performance hotspots and bottlenecks, these tools are not sufficiently suitable to evaluate the overall scalability trends of parallel applications. Either they lack the support for combining data from multiple runs or collect excessive data, causing unnecessary overhead. In this work, we present a tool for automatically measuring and comparing several executions of a parallel application according to various scenarios characterized by the input arrangements, the number of threads, number of cores, and frequencies. Unlike other existing performance analysis tools, the proposed work covers some gaps in specialized features necessary to better understand computational resources scalability trends across configurations. In order to improve scalability analysis and productivity over the vast spectrum of possible configurations, the proposed tool features automatic instrumentation, direct mapping of parallel regions, accuracy-preserving data reductions, and ease of use. As it aims at accurately understanding scalability trends of parallel applications, detailed single-run performance analyses show minimal intrusion (less than 1% overhead).

Highlights

The code optimization step is a fundamental part of the software construction strategy and is supported by performance measurement and analysis tools [2,3,4,5,6,7,8,9,10,11]
We offer an alternative tool for realizing parallel scalability analysis more efficiently than single-run-centric performance measurement and analysis tools
We used three experiments to assess the tool and demonstrate its ability to support analysis aimed at observing parallel scalability

Summary

Introduction

In order to improve scalability analysis and productivity over the vast spectrum of possible configurations, the proposed tool features automatic instrumentation, direct mapping of parallel regions, accuracy-preserving data reductions, and ease of use. As it aims at accurately understanding scalability trends of parallel applications, detailed single-run performance analyses show minimal intrusion (less than 1% overhead). Due to the complexity of parallel systems, correctly identifying and locating performance and scalability bottlenecks depends on the developer’s ability to compare several measurements in different execution configurations [4,13].

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Minimally Intrusive Approach for Automatic Assessment of Parallel Performance Scalability of Shared-Memory HPC Applications

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Journal: Electronics	Publication Date: Feb 23, 2022
License type: CC BY 4.0

Similar Papers

SCALEA: a performance analysis tool for parallel programs
Hong‐Linh Truong ... Thomas Fahringer
Concurrency and Computation: Practice and Experience | VOL. 15
Hong‐Linh Truong, et. al.Hong‐Linh Truong ... Thomas Fahringer
12 Aug 2003
Concurrency and Computation: Practice and Experience | VOL. 15

Timemory: Modular Performance Analysis for HPC
Jonathan R Madsen ... Jack Deslippe
-
Jonathan R Madsen, et. al.Jonathan R Madsen ... Jack Deslippe
01 Jan 2020
01 Jan 2020

ComDia+: An Interactive Visual Analytics System for Comparing, Diagnosing, and Improving Multiclass Classifiers
Chanhee Park ... Hyunwoo Han
-
Chanhee Park, et. al.Chanhee Park ... Hyunwoo Han
01 Apr 2019
01 Apr 2019

Model-Based Performance Analysis of Service-Oriented Systems
Dorina C Petriu
-
Dorina C PetriuDorina C Petriu
01 Jan 2010
01 Jan 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Minimally Intrusive Approach for Automatic Assessment of Parallel Performance Scalability of Shared-Memory HPC Applications

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics