Stochastic gradient descent, weighted sampling, and the randomized Kaczmarz algorithm

Deanna Needell,Nathan Srebro,Rachel Ward

doi:10.1007/s10107-015-0864-7

Abstract

We obtain an improved finite-sample guarantee on the linear convergence of stochastic gradient descent for smooth and strongly convex objectives, improving from a quadratic dependence on the conditioning $$(L/\mu )^2$$(L/μ)2 (where $$L$$L is a bound on the smoothness and $$\mu $$μ on the strong convexity) to a linear dependence on $$L/\mu $$L/μ. Furthermore, we show how reweighting the sampling distribution (i.e. importance sampling) is necessary in order to further improve convergence, and obtain a linear dependence in the average smoothness, dominating previous results. We also discuss importance sampling for SGD more broadly and show how it can improve convergence also in other scenarios. Our results are based on a connection we make between SGD and the randomized Kaczmarz algorithm, which allows us to transfer ideas between the separate bodies of literature studying each of the two methods. In particular, we recast the randomized Kaczmarz algorithm as an instance of SGD, and apply our results to prove its exponential convergence, but to the solution of a weighted least squares problem rather than the original least squares problem. We then present a modified Kaczmarz algorithm with partially biased sampling which does converge to the original least squares solution with the same exponential convergence rate.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Stochastic gradient descent, weighted sampling, and the randomized Kaczmarz algorithm

Abstract

Talk to us

Similar Papers

More From: Mathematical Programming

Lead the way for us

Journal: Mathematical Programming	Publication Date: Feb 3, 2015
Citations: 378

Similar Papers

Primal Averaging: A New Gradient Evaluation Step to Attain the Optimal Individual Convergence.
Wei Tao ... Qing Tao
IEEE Transactions on Cybernetics | VOL. 50
Wei Tao, et. al.Wei Tao ... Qing Tao
19 Oct 2018
IEEE Transactions on Cybernetics | VOL. 50

Randomized Kaczmarz algorithm for inconsistent linear systems: An exact MSE analysis
Chuang Wang ... Ameya Agaskar
-
Chuang Wang, et. al.Chuang Wang ... Ameya Agaskar
01 May 2015
01 May 2015

Kaczmarz Iterative Projection and Nonuniform Sampling with Complexity Estimates.
Tim Wallace ... Ali Sekmen
Journal of medical engineering | VOL. 2014
Tim Wallace, et. al.Tim Wallace ... Ali Sekmen
14 Dec 2014
Journal of medical engineering | VOL. 2014

Neighborhood Systems Priority Identification and Randomized Kaczmarz Algorithm
A.M Shmyrin ... E.P Trofimov
-
A.M Shmyrin, et. al.A.M Shmyrin ... E.P Trofimov
01 Sep 2018
01 Sep 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Stochastic gradient descent, weighted sampling, and the randomized Kaczmarz algorithm

Abstract

Talk to us

Similar Papers

More From: Mathematical Programming