Superfast Second-Order Methods for Unconstrained Convex Optimization

Yurii Nesterov

doi:10.1007/s10957-021-01930-y

Yurii Nesterov

Open Access

https://doi.org/10.1007/s10957-021-01930-y

Copy DOI

Abstract

In this paper, we present new second-order methods with convergence rate Oleft( k^{-4}right) , where k is the iteration counter. This is faster than the existing lower bound for this type of schemes (Agarwal and Hazan in Proceedings of the 31st conference on learning theory, PMLR, pp. 774–792, 2018; Arjevani and Shiff in Math Program 178(1–2):327–360, 2019), which is Oleft( k^{-7/2} right) . Our progress can be explained by a finer specification of the problem class. The main idea of this approach consists in implementation of the third-order scheme from Nesterov (Math Program 186:157–183, 2021) using the second-order oracle. At each iteration of our method, we solve a nontrivial auxiliary problem by a linearly convergent scheme based on the relative non-degeneracy condition (Bauschke et al. in Math Oper Res 42:330–348, 2016; Lu et al. in SIOPT 28(1):333–354, 2018). During this process, the Hessian of the objective function is computed once, and the gradient is computed Oleft( ln {1 over epsilon }right) times, where epsilon is the desired accuracy of the solution for our problem.

Highlights

In the last years, the theory of high-order methods in convex optimization was developed seemingly up to its natural limits
Yurii.Nesterov@uclouvain.be 1 Center for Operations Research and Econometrics (CORE), Catholic University of Louvain (UCL), Louvain-la-Neuve, Belgium Journal of Optimization Theory and Applications (2021) 191:1–30 auxiliary problem in tensor methods can be posed as a problem of minimizing a convex multivariate polynomial [15], very soon the performance of these methods was increased up to the maximal limits [6,7,9], given by the theoretical lower complexity bounds [1,2]
We conclude that the existing classification of the problem classes, optimization schemes, and complexity bounds is not perfect

Summary

B Yurii Nesterov

Journal of Optimization Theory and Applications (2021) 191:1–30 auxiliary problem in tensor methods can be posed as a problem of minimizing a convex multivariate polynomial [15], very soon the performance of these methods was increased up to the maximal limits [6,7,9], given by the theoretical lower complexity bounds [1,2]. 3, we analyze the rate of convergence of the gradient method based on the relative smoothness condition [4,10], under the assumption that the gradient of the objective function is computed with a small absolute error We need this analysis for replacing the exact value of the third derivative along two vectors by a finite difference of the gradients. We compute the Hessian once and the gradient is computed O ln 1 times, where is the desired accuracy of the solution of the main problem Recall that this rate of convergence is impossible for the second-order schemes working with the functions with Lipschitz-continuous third derivative (see [1,2]). Which is valid for all a, b ≥ 0 and p ≥ 1

Tensor Methods with Inexact Iteration

Relative Non-degeneracy and Approximate Gradients

R2τ2 2

Second-Order Implementations of the Third-Order Methods

Bounds for the Derivatives

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Optimization Theory and Applications	Publication Date: Aug 29, 2021
Citations: 20	License type: open-access

R Discovery Prime

R Discovery Prime

Superfast Second-Order Methods for Unconstrained Convex Optimization

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Optimization Theory and Applications

Lead the way for us

Similar Papers

Implementable tensor methods in unconstrained convex optimization
Yurii Nesterov
Mathematical Programming | VOL. 186
Yurii NesterovYurii Nesterov
21 Nov 2019
Mathematical Programming | VOL. 186

Regularized Newton method for unconstrained convex optimization
Roman A Polyak
Mathematical Programming | VOL. 120
Roman A PolyakRoman A Polyak
16 Jun 2007
Mathematical Programming | VOL. 120

Evaluation complexity of adaptive cubic regularization methods for convex unconstrained optimization
Coralia Cartis ... Philippe L Toint
Optimization Methods and Software | VOL. 27
Coralia Cartis, et. al.Coralia Cartis ... Philippe L Toint
01 Apr 2012
Optimization Methods and Software | VOL. 27

Comparative Study of Unconstrained Mechanical Optimization Methods Based on Two-variable Rosenbrock Function
Chun-Ming Li ... Xiao-Li Yin
DEStech Transactions on Computer Science and Engineering | VOL. -
Chun-Ming Li, et. al.Chun-Ming Li ... Xiao-Li Yin
09 Sep 2019
DEStech Transactions on Computer Science and Engineering | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Superfast Second-Order Methods for Unconstrained Convex Optimization

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Optimization Theory and Applications