Performance prediction techniques for scalable large data processing in distributed MPI systems

Janki Bhimani,Miriam Leeser,Ningfang Mi

doi:10.1109/pccc.2016.7820608

Janki Bhimani, Miriam Leeser + Show 1 more

https://doi.org/10.1109/pccc.2016.7820608

Copy DOI

Export

Save

Cite

Publication Date: Dec 1, 2016

Citations: 5

Affiliation: Universidad del Noreste

Abstract
Full-Text
Similar Papers

Abstract

Listen

Predicting performance of an application running on parallel computing platforms is increasingly becoming important due to the long development time of an application and the high resource management cost of parallel computing platforms. However, predicting overall performance is complex and must take into account both parallel calculation time and communication time. Difficulty in accurate performance modeling is compounded by myriad design choices along multiple dimensions, namely (i) process level parallelism, (ii) distribution of cores on multi-processor platforms, (iii) application related parameters, and (iv) characteristics of datasets. This research proposes a fast and accurate performance prediction approach to predict the calculation and communication time of an application running on a distributed computing platform. The major contribution of our prediction approach is that it can provide an accurate prediction of execution times for new datasets which have much larger sizes than the training datasets. Our approach consists of two models, i.e., a probabilistic self-learning model to predict calculation time and a simulation queuing model to predict network communication time. The combination of these two models provides data analysts a useful insight of optimal configuration of parallel resources (e.g., number of processes and number of cores) and application parameters setting.

Full Text