Byzantine-robust decentralized stochastic optimization with stochastic gradient noise-independent learning error

Jie Peng,Weiyu Li,Qing Ling

doi:10.1016/j.sigpro.2024.109419

Abstract

This paper studies Byzantine-robust stochastic optimization over a decentralized network, where every agent periodically communicates with its neighbors to exchange local models, and then updates its own local model with one or a mini-batch of local samples. The performance of such a method is affected by an unknown number of Byzantine agents, which conduct adversarially during the optimization process. To the best of our knowledge, there is no existing work that simultaneously achieves a linear convergence speed and a small learning error. We observe that the unsatisfactory trade-off between convergence speed and learning error is due to the intrinsic stochastic gradient noise. Motivated by this observation, we introduce two variance reduction methods, stochastic average gradient algorithm (SAGA) and loopless stochastic variance-reduced gradient (LSVRG), to Byzantine-robust decentralized stochastic optimization for eliminating the negative effect of the stochastic gradient noise. The two resulting methods, BRAVO-SAGA and BRAVO-LSVRG, enjoy both linear convergence speeds and stochastic gradient noise-independent learning errors. Such learning errors are optimal for a class of methods based on total variation (TV)-norm regularization and stochastic subgradient update. We conduct extensive numerical experiments to show their effectiveness under various Byzantine attacks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Byzantine-robust decentralized stochastic optimization with stochastic gradient noise-independent learning error

Abstract

Talk to us

Similar Papers

More From: Signal Processing

Lead the way for us

Similar Papers

Optimized convergence of stochastic gradient descent by weighted averaging
Melinda Hagedorn ... Florian Jarre
Optimization Methods and Software | VOL. ahead-of-print
Melinda Hagedorn, et. al.Melinda Hagedorn ... Florian Jarre
03 Feb 2024
Optimization Methods and Software | VOL. ahead-of-print

A Stochastic Gradient Method With Mesh Refinement for PDE-Constrained Optimization Under Uncertainty
Caroline Geiersbach ... Winnifried Wollner
SIAM Journal on Scientific Computing | VOL. 42
Caroline Geiersbach, et. al.Caroline Geiersbach ... Winnifried Wollner
01 Jan 2020
SIAM Journal on Scientific Computing | VOL. 42

Re-use of samples in stochastic annealing
Robin Ball ... Stephan Meisel
Computers and Operations Research | VOL. 164
Robin Ball, et. al.Robin Ball ... Stephan Meisel
14 Jan 2024
Computers and Operations Research | VOL. 164

Asynchronous stochastic convex optimization over random networks: Error bounds
B Touri ... A Nedic
-
B Touri, et. al.B Touri ... A Nedic
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Byzantine-robust decentralized stochastic optimization with stochastic gradient noise-independent learning error

Abstract

Talk to us

Similar Papers

More From: Signal Processing