An Adaptive Synchronous Parallel Strategy for Distributed Machine Learning

Jilin Zhang,Mingwei Li,Li Zhou,Jian Wan,Hangdi Tu,Jue Wang,Yongjian Ren

doi:10.1109/access.2018.2820899

Abstract

In recent years, distributed systems have mainly been used to train machine learning (ML) models. However, as a result of the different performances among computational nodes in a distributed cluster and delays in network transmission, the accuracies and convergence rates of ML models are relatively low. Therefore, it is necessary to design a reasonable strategy that provides dynamic communication optimization to improve the utilization of the cluster, accelerate the training times, and strengthen the accuracy of the training model. In this paper, we propose the adaptive synchronous parallel strategy for distributed ML. Through the performance monitoring model, the synchronization strategy of each computational node with the parameter server is adjusted adaptively by considering the full performance of each node, thereby ensuring higher accuracy. Furthermore, our strategy prevents the ML model from being affected by irrelevant tasks in the same cluster. Experiments show that our strategy fully improves clustering performance, and it ensures the accuracy and convergence speed of the model, increases the model training speed, and has good expansibility.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2018
Citations: 71	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

An Adaptive Synchronous Parallel Strategy for Distributed Machine Learning

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Do You Consent to the Use of Your Biological Data for Training ML and AI Models? Online Survey Targeting Clinicians and Researchers.
Yury Rusinovich ... Volha Rusinovich
Web3 Journal: ML in Health Science | VOL. 1
Yury Rusinovich, et. al.Yury Rusinovich ... Volha Rusinovich
27 Jan 2024
Web3 Journal: ML in Health Science | VOL. 1

A Radial Visualisation for Model Comparison and Feature Identification
Jianlong Zhou ... Fang Chen
-
Jianlong Zhou, et. al.Jianlong Zhou ... Fang Chen
08 May 2020
08 May 2020

Partitioning of green-blue water fluxes around the world: ML model explainability and predictability
Daniel Althoff ... Georgia Destouni
-
Daniel Althoff, et. al.Daniel Althoff ... Georgia Destouni
28 Mar 2022
28 Mar 2022

Development of Monthly Reference Evapotranspiration Machine Learning Models and Mapping of Pakistan—A Comparative Study
Jizhang Wang ... Kouadri Saber
Water | VOL. 14
Jizhang Wang, et. al.Jizhang Wang ... Kouadri Saber
23 May 2022
Water | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Adaptive Synchronous Parallel Strategy for Distributed Machine Learning

Abstract

Talk to us

Similar Papers

More From: IEEE Access