Abstract

Smart cities and other cyber-physical systems (CPSs) rely on various scientific, engineering, business, and social applications that provide timely intelligence for their design, operations, and management. Many of these scientific and analytics applications require the solution of sparse linear equation systems, where sparse matrix-vector (SpMV) product is a key computing operation. Several factors determine the performance of parallel SpMV computations, including matrix characteristics, storage formats, and the rising complexity and heterogeneity of computer systems. There is a pressing need for new ways of exploiting parallelism, and mapping data and applications to the computing resources. We propose here ZAKI+, a data-driven machine-learning approach, allowing users to automatically, effortlessly, and speedily obtain the best configuration (the data distribution, the optimal number of processes, and mapping strategy) and performance for the execution of the parallel SpMV computations on distributed memory machines. We train and test the tool using three machine learning methods-decision trees, random forest, and Xtreme boosting-and nearly 2000 real-world matrices obtained from 45 application domains, including computer vision and robotics. ZAKI+ provides optimal process mapping and outperforms the MPI default mapping policy by a factor of 4.24. This is the first work where the sparsity structure of matrices has been exploited to predict the optimal mapping of processes and data in distributed-memory environments by using different base and ensemble machine learning methods. Various CPSs comprise compute-intensive machine learning applications, such as the SpMV, and hence, the process and data mapping contributions of this paper would be of paramount impact for the CPSs.

Highlights

  • Cyber-Physical Systems (CPS) comprises ‘‘interacting digital, analog, physical, and human components engineered for function through integrated physics and logic

  • AIM AND CONTRIBUTIONS In our earlier work, we have proposed ZAKI that predicts the optimal number of processes for sparse matrix-vector (SpMV) computations of an arbitrary sparse matrix on a distributed memory machine [63]

  • The execution times for the predicted optimal configuration of SpMV computations are compared with the average execution times of MPI default mapping policy; ZAKI+ provides 4.24 times aggregated speedup over the MPI default mapping policy with average parallel execution times

Read more

Summary

INTRODUCTION

Cyber-Physical Systems (CPS) comprises ‘‘interacting digital, analog, physical, and human components engineered for function through integrated physics and logic. The aim is to allow application scientists to automatically, effortlessly, and speedily obtain the best configuration (including the matrix/data distribution, optimal number of processes, and mapping strategy), and the best performance, for the execution of the SpMV computations for a given sparse matrix (see Figure 1). We: propose, implement, and evaluate a machine learning tool that allows users to automatically obtain the best configuration (including the optimal mapping of the processes and data), and the best performance, for SpMV computations of a given sparse matrix on a distributed-memory machine. To the best of our knowledge, this is the first work of its kind where the sparsity structure of matrices have been exploited to predict the optimal mapping of the processes and data in distributed memory environments by using different base and ensemble machine learning methods.

BACKGROUND
15: Repeat 5 to 13
FEATURES SCALING
EXPERIMENTAL PLATFORM
ANALYSIS OF MAPPING STRATEGIES FOR SPMV PARALLELIZATION
PERFORMANCE GAIN
Findings
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call