A Parameter Communication Optimization Strategy for Distributed Machine Learning in Sensors.

Jilin Zhang,Lifeng Yu,Li Zhou,Yongjian Ren,Mingwei Li,Lei Zhang,Jian Wan,Hangdi Tu,Chang Zhao,Jue Wang

doi:10.3390/s17102172

Abstract

In order to utilize the distributed characteristic of sensors, distributed machine learning has become the mainstream approach, but the different computing capability of sensors and network delays greatly influence the accuracy and the convergence rate of the machine learning model. Our paper describes a reasonable parameter communication optimization strategy to balance the training overhead and the communication overhead. We extend the fault tolerance of iterative-convergent machine learning algorithms and propose the Dynamic Finite Fault Tolerance (DFFT). Based on the DFFT, we implement a parameter communication optimization strategy for distributed machine learning, named Dynamic Synchronous Parallel Strategy (DSP), which uses the performance monitoring model to dynamically adjust the parameter synchronization strategy between worker nodes and the Parameter Server (PS). This strategy makes full use of the computing power of each sensor, ensures the accuracy of the machine learning model, and avoids the situation that the model training is disturbed by any tasks unrelated to the sensors.

Highlights

Sensor networks have important applications such as environmental monitoring, industrial machine monitoring, body area networks and military target tracking
The weak threshold w ensures the finite of the fault tolerant, α is calculated from the performance monitoring model and will change from time to time during the training of the distributed machine learning to ensure the dynamics of the fault tolerance
We propose the Stale Synchronous Parallel Strategy, named Dynamic Synchronous Parallel Strategy (DSP), based on the Dynamic Finite Fault Tolerance (DFFT)

Summary

Introduction

Sensor networks have important applications such as environmental monitoring, industrial machine monitoring, body area networks and military target tracking. In order to solve the problems existing in BSP, the distributed machine learning asynchronous iterative strategy has been proposed [12,13,14,15], in which worker nodes can start the iteration by using the local model parameters before receiving the global model parameters. In this strategy, the fault tolerance is increased, which leads to the machine learning model falling into local optima, and the accuracy may not be guaranteed. DSP dynamically adjusts the parameter synchronization strategy between worker nodes and PS based on the performance monitoring model, which effectively reduces the influence of the different performance of worker nodes and ensures the accuracy as well as the convergence rate

Related Work

The Distributed Machine Learning System

Parameter Server System

Robust Optimization

Communication Optimization

Theoretical Analysis

Problems of SSP

Improvements ofnode

Experimental Environment

The Finite of the Fault Tolerance

11. This figure compares the accuracyof ofthe themodel model for each parameter

The Dynamics of the Fault Tolerance

Conclusions

In Abstract

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors (Basel, Switzerland)	Publication Date: Sep 21, 2017
Citations: 11	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Parameter Communication Optimization Strategy for Distributed Machine Learning in Sensors.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

An Adaptive Synchronous Parallel Strategy for Distributed Machine Learning
Jilin Zhang ... Yongjian Ren
IEEE Access | VOL. 6
Jilin Zhang, et. al.Jilin Zhang ... Yongjian Ren
01 Jan 2018
IEEE Access | VOL. 6

Distributed Machine Learning with a Serverless Architecture
Hao Wang ... Di Niu
-
Hao Wang, et. al.Hao Wang ... Di Niu
01 Apr 2019
01 Apr 2019

Parameter Communication Consistency Model for Large-Scale Security Monitoring Based on Mobile Computing
Rui Yang ... Jing Shen
IEEE Access | VOL. 7
Rui Yang, et. al.Rui Yang ... Jing Shen
01 Jan 2019
IEEE Access | VOL. 7

Communication-efficient Decentralized Machine Learning over Heterogeneous Networks
Pan Zhou ... Yuncheng Wu
-
Pan Zhou, et. al.Pan Zhou ... Yuncheng Wu
01 Apr 2021
01 Apr 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Parameter Communication Optimization Strategy for Distributed Machine Learning in Sensors.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)