Fast convergence to non-isolated minima: four equivalent conditions for $${\textrm{C}^{2}}$$ functions

Quentin Rebjock,Nicolas Boumal

doi:10.1007/s10107-024-02136-6

Quentin Rebjock, Nicolas Boumal

Open Access

https://doi.org/10.1007/s10107-024-02136-6

Copy DOI

Export

Save

Cite

Journal: Mathematical Programming	Publication Date: Sep 19, 2024
Citations: 1	License type: CC BY 4.0

Abstract
Full-Text
Similar Papers

Abstract

Listen

AbstractOptimization algorithms can see their local convergence rates deteriorate when the Hessian at the optimum is singular. These singularities are inescapable when the optima are non-isolated. Yet, under the right circumstances, several algorithms preserve their favorable rates even when optima form a continuum (e.g., due to over-parameterization). This has been explained under various structural assumptions, including the Polyak–Łojasiewicz condition, Quadratic Growth and the Error Bound. We show that, for cost functions which are twice continuously differentiable ($$\textrm{C}^2$$ C 2 ), those three (local) properties are equivalent. Moreover, we show they are equivalent to the Morse–Bott property, that is, local minima form differentiable submanifolds, and the Hessian of the cost function is positive definite along its normal directions. We leverage this insight to improve local convergence guarantees for safe-guarded Newton-type methods under any (hence all) of the above assumptions. First, for adaptive cubic regularization, we secure quadratic convergence even with inexact subproblem solvers. Second, for trust-region methods, we argue capture can fail with an exact subproblem solver, then proceed to show linear convergence with an inexact one (Cauchy steps).

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Fast convergence to non-isolated minima: four equivalent conditions for $${\textrm{C}^{2}}$$ functions

Abstract

Published Version

Talk to us

Similar Papers

More From: Mathematical Programming

Lead the way for us

Similar Papers

An adaptive motion regularization technique to support sliding motion in deformable image registration.
Yabo Fu ... Deshan Yang
Medical Physics | VOL. 45
Yabo Fu, et. al.Yabo Fu ... Deshan Yang
12 Jan 2018
Medical Physics | VOL. 45

Nonmonotone Globalization for Anderson Acceleration via Adaptive Regularization
Wenqing Ouyang ... Jiong Tao
Journal of Scientific Computing | VOL. 96
Wenqing Ouyang, et. al.Wenqing Ouyang ... Jiong Tao
18 May 2023
Journal of Scientific Computing | VOL. 96

Convex cost functions in blind equalization
S Vembu ... W Sethares
IEEE Transactions on Signal Processing | VOL. 42
S Vembu, et. al.S Vembu ... W Sethares
01 Jan 1993
IEEE Transactions on Signal Processing | VOL. 42

Generalized self-concordant functions: a recipe for Newton-type methods
Tianxiao Sun ... Quoc Tran-Dinh
Mathematical Programming | VOL. 178
Tianxiao Sun, et. al.Tianxiao Sun ... Quoc Tran-Dinh
08 May 2018
Mathematical Programming | VOL. 178

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Fast convergence to non-isolated minima: four equivalent conditions for $${\textrm{C}^{2}}$$ functions

Abstract

Published Version

Talk to us

Similar Papers

More From: Mathematical Programming