Decentralized Distributed Deep Learning with Low-Bandwidth Consumption for Smart Constellations

Qingliang Meng,Yao Xu,Xueshuang Xiang,Meiyu Huang,Naijin Liu

doi:10.34133/2021/9879246

Qingliang Meng, Yao Xu + Show 3 more

Open Access

https://doi.org/10.34133/2021/9879246

Copy DOI

Abstract

For the space-based remote sensing system, onboard intelligent processing based on deep learning has become an inevitable trend. To adapt to the dynamic changes of the observation scenes, there is an urgent need to perform distributed deep learning onboard to fully utilize the plentiful real-time sensing data of multiple satellites from a smart constellation. However, the network bandwidth of the smart constellation is very limited. Therefore, it is of great significance to carry out distributed training research in a low-bandwidth environment. This paper proposes a Randomized Decentralized Parallel Stochastic Gradient Descent (RD-PSGD) method for distributed training in a low-bandwidth network. To reduce the communication cost, each node in RD-PSGD just randomly transfers part of the information of the local intelligent model to its neighborhood. We further speed up the algorithm by optimizing the programming of random index generation and parameter extraction. For the first time, we theoretically analyze the convergence property of the proposed RD-PSGD and validate the advantage of this method by simulation experiments on various distributed training tasks for image classification on different benchmark datasets and deep learning network architectures. The results show that RD-PSGD can effectively save the time and bandwidth cost of distributed training and reduce the complexity of parameter selection compared with the TopK-based method. The method proposed in this paper provides a new perspective for the study of onboard intelligent processing, especially for online learning on a smart satellite constellation.

Highlights

With the breakthrough development of artificial intelligence and the rapid improvement of onboard computing and storage capabilities, it is an inevitable trend for remote sensing satellite systems to directly generate information required by users through intelligent processing onboard [1, 2]
Depending on how the tasks are parallelized across satellites, the distributed training can be divided into two categories: model parallelism and data parallelism [3]
We prove the convergence of Randomized Decentralized Parallel Stochastic Gradient Descent (RD-PSGD)

Summary

Introduction

With the breakthrough development of artificial intelligence and the rapid improvement of onboard computing and storage capabilities, it is an inevitable trend for remote sensing satellite systems to directly generate information required by users through intelligent processing onboard [1, 2]. Due to the particularity of the operating environment of the satellites, which is different from the cluster system on the ground, the network bandwidth of the smart constellation is often very limited It is of great significance and practical urgency to develop distributed deep learning research under a low-bandwidth environment. The decentralized network structure removes the central parameter server and allows all nodes to exchange parameters or gradients with adjacent nodes In this way, the pressure of communication can be shared with each node to avoid congestion and improve the real-time capability of distributed training. A novel method named RD-PSGD (Randomized Decentralized Parallel Stochastic Gradient Descent) for reducing communication bandwidth by parameter sparsification is proposed.

Methodology

Programming Optimization

Experiments

Conclusion and Future Work

Findings

Conflicts of Interest

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Space: Science & Technology	Publication Date: Jan 1, 2021
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Decentralized Distributed Deep Learning with Low-Bandwidth Consumption for Smart Constellations

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Space: Science & Technology

Lead the way for us

Similar Papers

Distributed Deep Reinforcement Learning: A Survey and a Multi-player Multi-agent Learning Toolbox
Qiyue Yin ... Shengqi Shen
Machine Intelligence Research | VOL. 21
Qiyue Yin, et. al.Qiyue Yin ... Shengqi Shen
11 Jan 2024
Machine Intelligence Research | VOL. 21

Instance segmentation on distributed deep learning big data cluster
Mohammed Elhmadany ... Hossam E Abdelmunim
Journal of Big Data | VOL. 11
Mohammed Elhmadany, et. al.Mohammed Elhmadany ... Hossam E Abdelmunim
02 Jan 2024
Journal of Big Data | VOL. 11

A review: Deep learning for medical image segmentation using multi-modality fusion
Tongxue Zhou ... Stéphane Canu
Array | VOL. 3-4
Tongxue Zhou, et. al.Tongxue Zhou ... Stéphane Canu
01 Sep 2019
Array | VOL. 3-4

Privacy-preserving distributed deep learning via LWE-based Certificateless Additively Homomorphic Encryption (CAHE)
Emmanuel Antwi-Boasiako ... Yingjie Dong
Journal of Information Security and Applications | VOL. 74
Emmanuel Antwi-Boasiako, et. al.Emmanuel Antwi-Boasiako ... Yingjie Dong
09 Mar 2023
Journal of Information Security and Applications | VOL. 74

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Decentralized Distributed Deep Learning with Low-Bandwidth Consumption for Smart Constellations

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Space: Science &amp; Technology

More From: Space: Science & Technology