Estimating a sharp convergence bound for randomized ensembles

Miles E Lopes

doi:10.1016/j.jspi.2019.04.004

Abstract

When randomized ensembles such as bagging or random forests are used for binary classification, the prediction error of the ensemble tends to decrease and stabilize as the number of classifiers increases. However, the precise relationship between prediction error and ensemble size is unknown in practice. In the standard case when classifiers are aggregated by majority vote, the present work offers a way to quantify this convergence in terms of “algorithmic variance,” i.e. the variance of prediction error due only to the randomized training algorithm. Specifically, we study a theoretical upper bound on this variance, and show that it is sharp — in the sense that it is attained by a specific family of randomized classifiers. Next, we address the problem of estimating the unknown value of the bound, which leads to a unique twist on the classical problem of non-parametric density estimation. In particular, we develop an estimator for the bound and show that its MSE matches optimal non-parametric rates under certain conditions. (Concurrent with this work, some closely related results have also been considered in Cannings and Samworth (2017) and Lopes (2019).)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Estimating a sharp convergence bound for randomized ensembles

Abstract

Talk to us

Similar Papers

More From: Journal of Statistical Planning and Inference

Lead the way for us

Journal: Journal of Statistical Planning and Inference	Publication Date: May 2, 2019
Citations: 7

Similar Papers

How large should ensembles of classifiers be?
Daniel Hernández-Lobato ... Alberto Suárez
Pattern Recognition | VOL. 46
Daniel Hernández-Lobato, et. al.Daniel Hernández-Lobato ... Alberto Suárez
07 Nov 2012
Pattern Recognition | VOL. 46

Random Survival Forests
Jeremy M.G Taylor
Journal of Thoracic Oncology | VOL. 6
Jeremy M.G TaylorJeremy M.G Taylor
01 Dec 2011
Journal of Thoracic Oncology | VOL. 6

Extrapolated Cross-Validation for Randomized Ensembles
Jin-Hong Du ... Arun Kumar Kuchibhotla
Journal of Computational and Graphical Statistics | VOL. 33
Jin-Hong Du, et. al.Jin-Hong Du ... Arun Kumar Kuchibhotla
24 Nov 2023
Journal of Computational and Graphical Statistics | VOL. 33

Estimating the algorithmic variance of randomized ensembles via the bootstrap
Miles E Lopes
The Annals of Statistics | VOL. 47
Miles E LopesMiles E Lopes
01 Apr 2019
The Annals of Statistics | VOL. 47

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Estimating a sharp convergence bound for randomized ensembles

Abstract

Talk to us

Similar Papers

More From: Journal of Statistical Planning and Inference