Sample size selection in optimization methods for machine learning

Richard H Byrd,Gillian M Chin,Jorge Nocedal,Yuchen Wu

doi:10.1007/s10107-012-0572-5

Abstract

This paper presents a methodology for using varying sample sizes in batch-type optimization methods for large-scale machine learning problems. The first part of the paper deals with the delicate issue of dynamic sample selection in the evaluation of the function and gradient. We propose a criterion for increasing the sample size based on variance estimates obtained during the computation of a batch gradient. We establish an $${O(1/\epsilon)}$$ complexity bound on the total cost of a gradient method. The second part of the paper describes a practical Newton method that uses a smaller sample to compute Hessian vector-products than to evaluate the function and the gradient, and that also employs a dynamic sampling technique. The focus of the paper shifts in the third part of the paper to L 1-regularized problems designed to produce sparse solutions. We propose a Newton-like method that consists of two phases: a (minimalistic) gradient projection phase that identifies zero variables, and subspace phase that applies a subsampled Hessian Newton iteration in the free variables. Numerical tests on speech recognition problems illustrate the performance of the algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Sample size selection in optimization methods for machine learning

Abstract

Talk to us

Similar Papers

More From: Mathematical Programming

Lead the way for us

Journal: Mathematical Programming	Publication Date: Jun 24, 2012
Citations: 388

Similar Papers

Evaluation of a decided sample size in machine learning applications
Daniyal Rajput ... Wei-Jen Wang
BMC Bioinformatics | VOL. 24
Daniyal Rajput, et. al.Daniyal Rajput ... Wei-Jen Wang
14 Feb 2023
BMC Bioinformatics | VOL. 24

Sample size and predictive performance of machine learning methods with survival data: A simulation study.
Gabriele Infante ... Rosalba Miceli
Statistics in Medicine | VOL. 42
Gabriele Infante, et. al.Gabriele Infante ... Rosalba Miceli
10 Nov 2023
Statistics in Medicine | VOL. 42

Non-asymptotic convergence analysis of inexact gradient methods for machine learning without strong convexity
Anthony Man-Cho So ... Zirui Zhou
Optimization Methods and Software | VOL. 32
Anthony Man-Cho So, et. al.Anthony Man-Cho So ... Zirui Zhou
31 May 2017
Optimization Methods and Software | VOL. 32

P010: An examination of sample size selection in medical record reviews in emergency medicine journals
J Vinken ... S Upadhye
CJEM | VOL. 22
J Vinken, et. al.J Vinken ... S Upadhye
01 May 2020
CJEM | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sample size selection in optimization methods for machine learning

Abstract

Talk to us

Similar Papers

More From: Mathematical Programming