Bridging the Architecture Gap: Abstracting Performance-Relevant Properties of Modern Server Processors

Jan Hofmann ,Christie L Alappat ,Dietmar Fey ,Georg Hager ,Gerhard Wellein

doi:10.14529/jsfi200204

Jan Hofmann , Christie L Alappat + Show 3 more

Open Access

https://doi.org/10.14529/jsfi200204

Copy DOI

Abstract

We describe a universal modeling approach for predicting single- and multicore runtime of steady-state loops on server processors. To this end we strictly differentiate between application and machine models: An application model comprises the loop code, problem sizes, and other runtime parameters, while a machine model is an abstraction of all performance-relevant properties of a CPU. We introduce a generic method for determining machine models and present results for relevant server-processor architectures by Intel, AMD, IBM, and Marvell/Cavium. Considering this wide range of architectures, the set of features required for adequate performance modeling is surprisingly small. To validate our approach, we compare performance predictions to empirical data for an OpenMP-parallel preconditioned CG algorithm, which includes compute- and memory-bound kernels. Both single- and multicore analysis shows that the model exhibits average and maximum relative errors of 5% and 10%. Deviations from the model and insights gained are discussed in detail.

Highlights

The architectural differences among processor models of different vendors lead to a diverse server-processor landscape in the high-performance computing market
We have shown that it is possible to set up a well-defined workflow for modeling the serial and parallel runtime of steady-state loops with regular data access patterns using the analytic ECM performance model
Four multicore server processors were investigated, and we could demonstrate that despite their obvious differences the main properties needed to set up a useful machine model can be summarized in a few parameters

Summary

Introduction

The architectural differences among processor models of different vendors (and even among models of a single vendor) lead to a diverse server-processor landscape in the high-performance computing market. In this work we introduce a structured method of establishing and describing those assumptions and parameters that best summarize the features of a multicore server processor. It has satisfactory predictive power in terms of performance modeling of (sequences of) steady-state loops with regular access patterns but is still simple enough to be carried out with pen and paper. As a consequence, reasoning about code performance from an architectural point of view becomes rooted in a scientific process

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Supercomputing Frontiers and Innovations	Publication Date: Jun 1, 2020
Citations: 8	License type: cc-by

R Discovery Prime

R Discovery Prime

Bridging the Architecture Gap: Abstracting Performance-Relevant Properties of Modern Server Processors

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Supercomputing Frontiers and Innovations

Lead the way for us

Similar Papers

Machine Learning Approach for Reservoir Petrophysical Properties Prediction from Well-Logs Data in the Niger Delta
Antigha Effiong Eyo ... Kilaliba Wanaemi Tugwell
-
Antigha Effiong Eyo, et. al.Antigha Effiong Eyo ... Kilaliba Wanaemi Tugwell
05 Aug 2024
05 Aug 2024

Optimization of Power Frequency Withstand Voltage Characteristics of Thermal Electrochemical Oxide Ceramic Film Based on Machine Learning
Zhen Yan ... Haomin Li
-
Zhen Yan, et. al.Zhen Yan ... Haomin Li
11 Apr 2021
11 Apr 2021

Development and testing of a grain combine harvester throughput monitoring system
Yawei Zhang ... Dong Dai
Computers and Electronics in Agriculture | VOL. 200
Yawei Zhang, et. al.Yawei Zhang ... Dong Dai
03 Aug 2022
Computers and Electronics in Agriculture | VOL. 200

Reconstruction of vertical thermal structure from several subsurface temperatures in the China Seas and adjacent waters
Jiajia Hao ... Yongli Chen
Chinese Journal of Oceanology and Limnology | VOL. 27
Jiajia Hao, et. al.Jiajia Hao ... Yongli Chen
01 May 2009
Chinese Journal of Oceanology and Limnology | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bridging the Architecture Gap: Abstracting Performance-Relevant Properties of Modern Server Processors

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Supercomputing Frontiers and Innovations