Improved Step-Size Schedules for Proximal Noisy Gradient Methods

Sarit Khirirat,Sindri Magnússon,Mikael Johansson,Xiaoyu Wang

doi:10.1109/tsp.2023.3237392

Abstract

Noisy gradient algorithms have emerged as one of the most popular algorithms for distributed optimization with massive data. Choosing proper step-size schedules is an important task to tune in the algorithms for good performance. For the algorithms to attain fast convergence and high accuracy, it is intuitive to use large step-sizes in the initial iterations when the gradient noise is typically small compared to the algorithm-steps, and reduce the step-sizes as the algorithm progresses. This intuition has been confirmed in theory and practice for stochastic gradient descent. However, similar results are lacking for other methods using approximate gradients. This paper shows that the diminishing step-size strategies can indeed be applied for a broad class of noisy gradient algorithms. Our analysis framework is based on two classes of systems that characterize the impact of the step-sizes on the convergence performance of many algorithms. Our results show that such step-size schedules enable these algorithms to enjoy the optimal rate. We exemplify our results on stochastic compression algorithms. Our experiments validate fast convergence of these algorithms with the step decay schedules.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improved Step-Size Schedules for Proximal Noisy Gradient Methods

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Signal Processing

Lead the way for us

Journal: IEEE Transactions on Signal Processing	Publication Date: Jan 1, 2023
Citations: 6

Similar Papers

Improved Step-Size Schedules for Noisy Gradient Methods
Sarit Khirirat ... Sindri Magnusson
-
Sarit Khirirat, et. al.Sarit Khirirat ... Sindri Magnusson
06 Jun 2021
06 Jun 2021

On the Convergence Properties of a K-step Averaging Stochastic Gradient Descent Algorithm for Nonconvex Optimization
Fan Zhou ... Guojing Cong
-
Fan Zhou, et. al.Fan Zhou ... Guojing Cong
01 Jul 2018
01 Jul 2018

Variance Reduced Stochastic Proximal Algorithm for AUC Maximization
Soham Dan ... Dushyant Sahoo
-
Soham Dan, et. al.Soham Dan ... Dushyant Sahoo
01 Jan 2020
01 Jan 2020

Adaptive Bayes-Adam MIMO Equalizer With High Accuracy and Fast Convergence for Orbital Angular Momentum Mode Division Multiplexed Transmission
Sihan Wang ... Han Zhang
Journal of Lightwave Technology | VOL. 41
Sihan Wang, et. al.Sihan Wang ... Han Zhang
01 Aug 2023
Journal of Lightwave Technology | VOL. 41

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improved Step-Size Schedules for Proximal Noisy Gradient Methods

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Signal Processing