MPEC: Distributed Matrix Multiplication Performance Modeling on a Scale-Out Cloud Environment for Data Mining Jobs

Jeongchul Kim,Kyungyong Lee,Myungjun Son

doi:10.1109/tcc.2019.2950400

Abstract

Many data mining workloads are being analyzed in large-scale distributed cloud computing environments which provide nearly infinite resources with diverse hardware configurations. To maintain cost-efficiency in such environments, understanding the characteristics and estimating the overheads of a distributed matrix multiplication task that is a core computation kernel in many machine learning algorithms are essential. This study aims to propose a Matrix Multiplication Performance Estimator on Cloud (MPEC) algorithm. The proposed algorithm predicts the latency incurred when executing distributed matrix multiplication tasks of various input sizes and shapes with diverse instance types and a different number of worker nodes on cloud computing environments. To achieve this goal, we first analyze the characteristics of distributed matrix multiplication tasks. With characteristics generated from qualitative analysis, we propose to apply an ensemble of non-linear regression algorithm to predict the execution time of arbitrary matrix multiplication tasks. Thorough experimental results reveal that the proposed algorithm demonstrates higher accuracy than a state-of-the-art machine learning task performance estimation engine, Ernest, by decreasing the Mean Absolute Percentage Error (MAPE) in half.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MPEC: Distributed Matrix Multiplication Performance Modeling on a Scale-Out Cloud Environment for Data Mining Jobs

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Cloud Computing

Lead the way for us

Journal: IEEE Transactions on Cloud Computing	Publication Date: Jan 1, 2022
Citations: 11

Similar Papers

Wind speed prediction in some major cities in Africa using Linear Regression and Random Forest algorithms
Timothy Kayode Samson ... Francis Olatunbosun Aweda
Journal of the Nigerian Society of Physical Sciences | VOL. -
Timothy Kayode Samson, et. al.Timothy Kayode Samson ... Francis Olatunbosun Aweda
08 Sep 2024
Journal of the Nigerian Society of Physical Sciences | VOL. -

Foretelling the compressive strength of bamboo using machine learning techniques
Saurabh Dubey ... Mainak Mallik
Engineering Computations | VOL. 41
Saurabh Dubey, et. al.Saurabh Dubey ... Mainak Mallik
30 Sep 2024
Engineering Computations | VOL. 41

Consultation length and no-show prediction for improving appointment scheduling efficiency at a cardiology clinic: A data analytics approach
Sharan Srinivas ... Haya Salah
International Journal of Medical Informatics | VOL. 145
Sharan Srinivas, et. al.Sharan Srinivas ... Haya Salah
01 Oct 2020
International Journal of Medical Informatics | VOL. 145

Use of machine learning approaches for body weight prediction in Peruvian Corriedale Sheep
Ali William Canaza-Cayo ... Yhan Carlos Rojas De La Cruz
Smart Agricultural Technology | VOL. 7
Ali William Canaza-Cayo, et. al.Ali William Canaza-Cayo ... Yhan Carlos Rojas De La Cruz
20 Feb 2024
Smart Agricultural Technology | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MPEC: Distributed Matrix Multiplication Performance Modeling on a Scale-Out Cloud Environment for Data Mining Jobs

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Cloud Computing