Combining Stochastic Adaptive Cubic Regularization with Negative Curvature for Nonconvex Optimization

Seonho Park,Panos M Pardalos,Seung Hyun Jung

doi:10.1007/s10957-019-01624-6

Abstract

We focus on minimizing nonconvex finite-sum functions that typically arise in machine learning problems. In an attempt to solve this problem, the adaptive cubic-regularized Newton method has shown its strong global convergence guarantees and the ability to escape from strict saddle points. In this paper, we expand this algorithm to incorporating the negative curvature method to update even at unsuccessful iterations. We call this new method Stochastic Adaptive cubic regularization with Negative Curvature (SANC). Unlike the previous method, in order to attain stochastic gradient and Hessian estimators, the SANC algorithm uses independent sets of data points of consistent size over all iterations. It makes the SANC algorithm more practical to apply for solving large-scale machine learning problems. To the best of our knowledge, this is the first approach that combines the negative curvature method with the adaptive cubic-regularized Newton method. Finally, we provide experimental results, including neural networks problems supporting the efficiency of our method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Combining Stochastic Adaptive Cubic Regularization with Negative Curvature for Nonconvex Optimization

Abstract

Talk to us

Similar Papers

More From: Journal of Optimization Theory and Applications

Lead the way for us

Journal: Journal of Optimization Theory and Applications	Publication Date: Dec 24, 2019
Citations: 18

Similar Papers

Symmetry. Saddle Points, and Global Optimization Landscape of Nonconvex Matrix Factorization
Xingguo Li ... Raman Arora
-
Xingguo Li, et. al.Xingguo Li ... Raman Arora
01 Feb 2018
01 Feb 2018

Symmetry, Saddle Points, and Global Optimization Landscape of Nonconvex Matrix Factorization
Xingguo Li ... Jarvis Haupt
IEEE Transactions on Information Theory | VOL. 65
Xingguo Li, et. al.Xingguo Li ... Jarvis Haupt
01 Jun 2019
IEEE Transactions on Information Theory | VOL. 65

A Diffusion Approximation Theory of Momentum Stochastic Gradient Descent in Nonconvex Optimization
Tianyi Liu ... Enlu Zhou
Stochastic Systems | VOL. 11
Tianyi Liu, et. al.Tianyi Liu ... Enlu Zhou
21 Oct 2021
Stochastic Systems | VOL. 11

Behavior of accelerated gradient methods near critical points of nonconvex functions
Michael O’Neill ... Stephen J Wright
Mathematical Programming | VOL. 176
Michael O’Neill, et. al.Michael O’Neill ... Stephen J Wright
26 Oct 2018
Mathematical Programming | VOL. 176

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Combining Stochastic Adaptive Cubic Regularization with Negative Curvature for Nonconvex Optimization

Abstract

Talk to us

Similar Papers

More From: Journal of Optimization Theory and Applications