Stability-Based Generalization Analysis of the Asynchronous Decentralized SGD

Xiaoge Deng,Dongsheng Li,Shengwei Li,Tao Sun

doi:10.1609/aaai.v37i6.25894

Abstract

The generalization ability often determines the success of machine learning algorithms in practice. Therefore, it is of great theoretical and practical importance to understand and bound the generalization error of machine learning algorithms. In this paper, we provide the first generalization results of the popular stochastic gradient descent (SGD) algorithm in the distributed asynchronous decentralized setting. Our analysis is based on the uniform stability tool, where stable means that the learned model does not change much in small variations of the training set. Under some mild assumptions, we perform a comprehensive generalizability analysis of the asynchronous decentralized SGD, including generalization error and excess generalization error bounds for the strongly convex, convex, and non-convex cases. Our theoretical results reveal the effects of the learning rate, training data size, training iterations, decentralized communication topology, and asynchronous delay on the generalization performance of the asynchronous decentralized SGD. We also study the optimization error regarding the objective function values and investigate how the initial point affects the excess generalization error. Finally, we conduct extensive experiments on MNIST, CIFAR-10, CIFAR-100, and Tiny-ImageNet datasets to validate the theoretical findings.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Stability-Based Generalization Analysis of the Asynchronous Decentralized SGD

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Jun 26, 2023
Citations: 1

Similar Papers

Generalization Error Bounds for Optimization Algorithms via Stability
Qi Meng ... Yue Wang
Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence | VOL. 31
Qi Meng, et. al.Qi Meng ... Yue Wang
13 Feb 2017
Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence | VOL. 31

The Improved Stochastic Fractional Order Gradient Descent Algorithm
Yang Yang ... Yusen Hu
Fractal and Fractional | VOL. 7
Yang Yang, et. al.Yang Yang ... Yusen Hu
18 Aug 2023
Fractal and Fractional | VOL. 7

New Stochastic Gradient Descent Algorithm via Lagrange-type 1-step-ahead Numerical Differentiation
Wendi Luo ... Bo Peng
-
Wendi Luo, et. al.Wendi Luo ... Bo Peng
14 May 2021
14 May 2021

Stability and optimization error of stochastic gradient descent for pairwise learning
Wei Shen ... Yiming Ying
Analysis and Applications | VOL. 18
Wei Shen, et. al.Wei Shen ... Yiming Ying
22 Aug 2019
Analysis and Applications | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Stability-Based Generalization Analysis of the Asynchronous Decentralized SGD

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence