Quasi-Newton methods for machine learning: forget the past, just sample

A S Berahas,M Jahani,P Richtárik,M Takáč

doi:10.1080/10556788.2021.1977806

Abstract

We present two sampled quasi-Newton methods (sampled LBFGS and sampled LSR1) for solving empirical risk minimization problems that arise in machine learning. Contrary to the classical variants of these methods that sequentially build Hessian or inverse Hessian approximations as the optimization progresses, our proposed methods sample points randomly around the current iterate at every iteration to produce these approximations. As a result, the approximations constructed make use of more reliable (recent and local) information and do not depend on past iterate information that could be significantly stale. Our proposed algorithms are efficient in terms of accessed data points (epochs) and have enough concurrency to take advantage of parallel/distributed computing environments. We provide convergence guarantees for our proposed methods. Numerical tests on a toy classification problem as well as on popular benchmarking binary classification and neural network training tasks reveal that the methods outperform their classical variants.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Optimization Methods and Software	Publication Date: Oct 18, 2021
Citations: 15	License type: cc-by

R Discovery Prime

R Discovery Prime

Quasi-Newton methods for machine learning: forget the past, just sample

Abstract

Talk to us

Similar Papers

More From: Optimization Methods and Software

Lead the way for us

Similar Papers

Secant penalized BFGS: a noise robust quasi-Newton method via penalizing the secant condition
Brian Irwin ... Eldad Haber
Computational Optimization and Applications | VOL. 84
Brian Irwin, et. al.Brian Irwin ... Eldad Haber
09 Jan 2023
Computational Optimization and Applications | VOL. 84

Quasi-Newton algorithms: Approaches and motivations
S Oren
-
S OrenS Oren
01 Dec 1973
01 Dec 1973

A theoretical and experimental study of the Broyden-Fletcher-Goldfarb-Shano (BFGS) update
...
-
, et. al. ...
31 Aug 2013
31 Aug 2013

The application value of Rs-fMRI-based machine learning models for differentiating mild cognitive impairment from Alzheimer's disease: a systematic review and meta-analysis.
Chentong Wang ... Tingting Fu
Neurological sciences : official journal of the Italian Neurological Society and of the Italian Society of Clinical Neurophysiology | VOL. -
Chentong Wang, et. al.Chentong Wang ... Tingting Fu
03 Sep 2024
Neurological sciences : official journal of the Italian Neurological Society and of the Italian Society of Clinical Neurophysiology | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Quasi-Newton methods for machine learning: forget the past, just sample

Abstract

Talk to us

Similar Papers

More From: Optimization Methods and Software