Non-asymptotic convergence analysis of inexact gradient methods for machine learning without strong convexity

Anthony Man-Cho So,Zirui Zhou

doi:10.1080/10556788.2017.1296439

Abstract

Many recent applications in machine learning and data fitting call for the algorithmic solution of structured smooth convex optimization problems. Although the gradient descent method is a natural choice for this task, it requires exact gradient computations and hence can be inefficient when the problem size is large or the gradient is difficult to evaluate. Therefore, there has been much interest in inexact gradient methods (IGMs), in which an efficiently computable approximate gradient is used to perform the update in each iteration. Currently, non-asymptotic linear convergence results for IGMs are typically established under the assumption that the objective function is strongly convex, which is not satisfied in many applications of interest; while linear convergence results that do not require the strong convexity assumption are usually asymptotic in nature. In this paper, we combine the best of these two types of results by developing a framework for analysing the non-asymptotic convergence rates of IGMs when they are applied to a class of structured convex optimization problems that includes least squares regression and logistic regression. We then demonstrate the power of our framework by proving, in a unified manner, new linear convergence results for three recently proposed algorithms—the incremental gradient method with increasing sample size [R.H. Byrd, G.M. Chin, J. Nocedal, and Y. Wu, Sample size selection in optimization methods for machine learning, Math. Program. Ser. B 134 (2012), pp. 127–155; M.P. Friedlander and M. Schmidt, Hybrid deterministic–stochastic methods for data fitting, SIAM J. Sci. Comput. 34 (2012), pp. A1380–A1405], the stochastic variance-reduced gradient (SVRG) method [R. Johnson and T. Zhang, Accelerating stochastic gradient descent using predictive variance reduction, Advances in Neural Information Processing Systems 26: Proceedings of the 2013 Conference, 2013, pp. 315–323], and the incremental aggregated gradient (IAG) method [D. Blatt, A.O. Hero, and H. Gauchman, A convergent incremental gradient method with a constant step size, SIAM J. Optim. 18 (2007), pp. 29–51]. We believe that our techniques will find further applications in the non-asymptotic convergence analysis of other first-order methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Non-asymptotic convergence analysis of inexact gradient methods for machine learning without strong convexity

Abstract

Talk to us

Similar Papers

More From: Optimization Methods and Software

Lead the way for us

Journal: Optimization Methods and Software	Publication Date: May 31, 2017
Citations: 56

Similar Papers

An Incremental Clustered Gradient Method for Wireless Sensor Networks
Anil Mahmud ... Md Shopon
-
Anil Mahmud, et. al.Anil Mahmud ... Md Shopon
01 Apr 2018
01 Apr 2018

Convergence Rate of Incremental Gradient and Incremental Newton Methods
M Gürbüzbalaban ... P A Parrilo
SIAM Journal on Optimization | VOL. 29
M Gürbüzbalaban, et. al.M Gürbüzbalaban ... P A Parrilo
01 Jan 2019
SIAM Journal on Optimization | VOL. 29

A Convergent Incremental Gradient Method with a Constant Step Size
Doron Blatt ... Alfred O Hero
SIAM Journal on Optimization | VOL. 18
Doron Blatt, et. al.Doron Blatt ... Alfred O Hero
01 Jan 2007
SIAM Journal on Optimization | VOL. 18

Linear convergence of cyclic SAGA
Youngsuk Park ... Ernest K Ryu
Optimization Letters | VOL. 14
Youngsuk Park, et. al.Youngsuk Park ... Ernest K Ryu
04 Jan 2020
Optimization Letters | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Non-asymptotic convergence analysis of inexact gradient methods for machine learning without strong convexity

Abstract

Talk to us

Similar Papers

More From: Optimization Methods and Software