Dynamic Model Evaluation to Accelerate Distributed Machine Learning

Simon Caton,Kostas Katrinis,Shashi Bhushan Tn,Vidya Sankar Velamuri,Srikumar Venugopal

doi:10.1109/bigdatacongress.2018.00027

Abstract

The increase in the volume and variety of data has increased the reliance of data scientists on shared computational resources, either in-house or obtained via cloud providers, to execute machine learning and artificial intelligence programs. This, in turn, has created challenges of exploiting available resources to execute such cognitive workloads quickly and effectively to gather the needed knowledge and data insight. A common challenge in machine learning is knowing when to stop model building. This is often exacerbated in the presence of big data as a trade off between the cost of producing the model (time, volume of training data, resources utilised) and its general performance. Whilst there are many tools and application stacks available to train models over distributed resources, the challenge of knowing when a model is good enough or no longer worth pursuing persists. In this paper, we propose a framework for the evaluating the models produced by distributed machine learning algorithms during the training process. This framework integrates with the cluster job scheduler so as to finalise model training under constraints of resource availability or time, or simply because model performance is asymptotic with further training. We present a prototype implementation of this framework using Apache Spark and YARN, and demonstrate the benefits of this approach using sample applications with both supervised and unsupervised learning algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dynamic Model Evaluation to Accelerate Distributed Machine Learning

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Artificial intelligence: Friend or foe?
Anusch Yazdani ... Sam Costa
Australian and New Zealand Journal of Obstetrics and Gynaecology | VOL. 63
Anusch Yazdani, et. al.Anusch Yazdani ... Sam Costa
01 Apr 2023
Australian and New Zealand Journal of Obstetrics and Gynaecology | VOL. 63

SLC: A Permissioned Blockchain for Secure Distributed Machine Learning against Byzantine Attacks
Lun Liang ... Jun Zhang
-
Lun Liang, et. al.Lun Liang ... Jun Zhang
06 Nov 2020
06 Nov 2020

Performance Evaluation of Machine Learning Models on Apache Spark: An Empirical Study
Asma Z Yamani ... Imane Boudellioua
-
Asma Z Yamani, et. al.Asma Z Yamani ... Imane Boudellioua
04 Dec 2022
04 Dec 2022

IoT and distributed machine learning powered optimal state recommender solution
Mohit Sewak ... Sachchidanand Singh
-
Mohit Sewak, et. al.Mohit Sewak ... Sachchidanand Singh
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dynamic Model Evaluation to Accelerate Distributed Machine Learning

Abstract

Talk to us

Similar Papers