MODES: model-based optimization on distributed embedded systems

Junjie Shi,Jian-Jia Chen,Jakob Richter,Kuan-Hsun Chen,Haoyi Xiong,Jörg Rahnenführer,Jiang Bian

doi:10.1007/s10994-021-06014-6

Junjie Shi, Jian-Jia Chen + Show 5 more

Open Access

https://doi.org/10.1007/s10994-021-06014-6

Copy DOI

Abstract

The predictive performance of a machine learning model highly depends on the corresponding hyper-parameter setting. Hence, hyper-parameter tuning is often indispensable. Normally such tuning requires the dedicated machine learning model to be trained and evaluated on centralized data to obtain a performance estimate. However, in a distributed machine learning scenario, it is not always possible to collect all the data from all nodes due to privacy concerns or storage limitations. Moreover, if data has to be transferred through low bandwidth connections it reduces the time available for tuning. Model-Based Optimization (MBO) is one state-of-the-art method for tuning hyper-parameters but the application on distributed machine learning models or federated learning lacks research. This work proposes a framework textit{MODES} that allows to deploy MBO on resource-constrained distributed embedded systems. Each node trains an individual model based on its local data. The goal is to optimize the combined prediction accuracy. The presented framework offers two optimization modes: (1) textit{MODES}-B considers the whole ensemble as a single black box and optimizes the hyper-parameters of each individual model jointly, and (2) textit{MODES}-I considers all models as clones of the same black box which allows it to efficiently parallelize the optimization in a distributed setting. We evaluate textit{MODES} by conducting experiments on the optimization for the hyper-parameters of a random forest and a multi-layer perceptron. The experimental results demonstrate that, with an improvement in terms of mean accuracy (textit{MODES}-B), run-time efficiency (textit{MODES}-I), and statistical stability for both modes, textit{MODES} outperforms the baseline, i.e., carry out tuning with MBO on each node individually with its local sub-data set.

Highlights

Nowadays, statistical and machine learning algorithms are used more frequently and intensively to solve problems in a wide range of applications, e.g., smart home, medical diagnosis, and environment analysis
We evaluated all combinations and report the accuracy of the classification results for two machine learning algorithms and three data splitting strategies separately for the different data sets
These results show that the B-EI outperforms all the other methods in most of the evaluated cases w.r.t. the mean prediction accuracy and/or statistical stability

Summary

Introduction

Statistical and machine learning algorithms are used more frequently and intensively to solve problems in a wide range of applications, e.g., smart home, medical diagnosis, and environment analysis. The most direct and easy to implement tuning algorithm is grid search (LeCun et al 2012) which discretizes the hyper-parameter search space and exhaustively evaluates all possible combinations in a Cartesian grid to find the setting with the best performance. Another variation is random search (Bergstra and Bengio 2012), which randomly samples hyperparameter settings from the search space. Several extensions are proposed to speed up the BNN, e.g., sample multiple sub-networks from a network trained with Dropout (Srivastava et al 2014; Gal and Ghahramani 2016)

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Machine Learning	Publication Date: Jun 1, 2021
Citations: 5	License type: open-access

R Discovery Prime

R Discovery Prime

MODES: model-based optimization on distributed embedded systems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Machine Learning

Lead the way for us

Similar Papers

Investigation of optimized machine learning models with PSO for forecasting the shear capacity of steel fiber-reinforced SCC beams with/out stirrups
Faruk Ergen ... Metin Katlav
Journal of Building Engineering | VOL. 83
Faruk Ergen, et. al.Faruk Ergen ... Metin Katlav
03 Jan 2024
Journal of Building Engineering | VOL. 83

Predictions of Root Tensile Strength for Different Vegetation Species Using Individual and Ensemble Machine Learning Models
Tarun Semwal ... K V Uday
-
Tarun Semwal, et. al.Tarun Semwal ... K V Uday
20 Nov 2022
20 Nov 2022

Machine learning-based prediction of in-ICU mortality in pneumonia patients
Eun-Tae Jeon ... Dong Hyun Kim
Scientific Reports | VOL. 13
Eun-Tae Jeon, et. al.Eun-Tae Jeon ... Dong Hyun Kim
17 Jul 2023
Scientific Reports | VOL. 13

Towards white box modeling of compressive strength of sustainable ternary cement concrete using explainable artificial intelligence (XAI)
Syed Muhammad Ibrahim ... Syed Danish Hasan
Applied Soft Computing | VOL. 149
Syed Muhammad Ibrahim, et. al.Syed Muhammad Ibrahim ... Syed Danish Hasan
02 Nov 2023
Applied Soft Computing | VOL. 149

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MODES: model-based optimization on distributed embedded systems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Machine Learning