Boosting the margin: a new explanation for the effectiveness of voting methods

Peter Bartlett,Robert E Schapire,Yoav Freund,Wee Sun Lee

doi:10.1214/aos/1024691352

Abstract

One of the surprising recurring phenomena observed in experiments with boosting is that the test error of the generated classifier usually does not increase as its size becomes very large, and often is observed to decrease even after the training error reaches zero. In this paper, we show that this phenomenon is related to the distribution of margins of the training examples with respect to the generated voting classification rule, where the margin of an example is simply the difference between the number of correct votes and the maximum number of votes received by any incorrect label. We show that techniques used in the analysis of Vapnik’s support vector classifiers and of neural networks with small weights can be applied to voting methods to relate the margin distribution to the test error. We also show theoretically and experimentally that boosting is especially effective at increasing the margins of the training examples. Finally, we compare our explanation to those based on the bias-variance decomposition.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Boosting the margin: a new explanation for the effectiveness of voting methods

Abstract

Talk to us

Similar Papers

More From: The Annals of Statistics

Lead the way for us

Journal: The Annals of Statistics	Publication Date: Oct 1, 1998
Citations: 2082

Similar Papers

Good results from sensor data: Performance of machine learning algorithms for regression problems in chemical sensors
Lajos Höfler
Sensors and Actuators: B. Chemical | VOL. 421
Lajos HöflerLajos Höfler
29 Aug 2024
Sensors and Actuators: B. Chemical | VOL. 421

Influence of Varying Training Set Composition and Size on Support Vector Machine-Based Prediction of Active Compounds
Raquel Rodríguez-Pérez ... Martin Vogt
Journal of Chemical Information and Modeling | VOL. 57
Raquel Rodríguez-Pérez, et. al.Raquel Rodríguez-Pérez ... Martin Vogt
10 Apr 2017
Journal of Chemical Information and Modeling | VOL. 57

Automatic Classification of Swedish Metadata Using Dewey Decimal Classification: A Comparison of Approaches
Koraljka Golub ... Johan Hagelbäck
Journal of Data and Information Science | VOL. 5
Koraljka Golub, et. al.Koraljka Golub ... Johan Hagelbäck
01 Feb 2020
Journal of Data and Information Science | VOL. 5

Understanding and formulation of various kernel techniques for suport vector machines
Prayashi Bohra ... Hemant Palivela
-
Prayashi Bohra, et. al.Prayashi Bohra ... Hemant Palivela
01 Dec 2015
01 Dec 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Boosting the margin: a new explanation for the effectiveness of voting methods

Abstract

Talk to us

Similar Papers

More From: The Annals of Statistics