A probabilistic classifier ensemble weighting scheme based on cross-validated accuracy estimates

James Large,Anthony Bagnall,Jason Lines

doi:10.1007/s10618-019-00638-y

James Large, Anthony Bagnall + Show 1 more

Open Access

https://doi.org/10.1007/s10618-019-00638-y

Copy DOI

Journal: Data Mining and Knowledge Discovery	Publication Date: Jun 17, 2019
Citations: 73	License type: open-access

Affiliation: University of East Anglia

Abstract

Our hypothesis is that building ensembles of small sets of strong classifiers constructed with different learning algorithms is, on average, the best approach to classification for real-world problems. We propose a simple mechanism for building small heterogeneous ensembles based on exponentially weighting the probability estimates of the base classifiers with an estimate of the accuracy formed through cross-validation on the train data. We demonstrate through extensive experimentation that, given the same small set of base classifiers, this method has measurable benefits over commonly used alternative weighting, selection or meta-classifier approaches to heterogeneous ensembles. We also show how an ensemble of five well-known, fast classifiers can produce an ensemble that is not significantly worse than large homogeneous ensembles and tuned individual classifiers on datasets from the UCI archive. We provide evidence that the performance of the cross-validation accuracy weighted probabilistic ensemble (CAWPE) generalises to a completely separate set of datasets, the UCR time series classification archive, and we also demonstrate that our ensemble technique can significantly improve the state-of-the-art classifier for this problem domain. We investigate the performance in more detail, and find that the improvement is most marked in problems with smaller train sets. We perform a sensitivity analysis and an ablation study to demonstrate the robustness of the ensemble and the significant contribution of each design element of the classifier. We conclude that it is, on average, better to ensemble strong classifiers with a weighting scheme rather than perform extensive tuning and that CAWPE is a sensible starting point for combining classifiers.

Highlights

Investigation into the properties and characteristics of classification algorithms forms a significant component of all research in machine learning
Data mining is an intrinsically practical exercise and our interest is in answering the following question: if we have a new classification problem or set of problems, what family of models should we use given our computational constraints? This interest has arisen from our work in the domain of time series classification (Bagnall et al 2017) and through working with many industrial partners, but we cannot find an acceptable answer in the literature
Using data derived from the UCI archive, we find that a small ensemble of five untuned simple classifiers combined using cross-validation accuracy weighted probabilistic ensemble (CAWPE) is not significantly worse than either state-of-the-art untuned homogeneous ensembles, nor tuned random forest, support vector machine, multilayer perceptron and gradient boosting classifiers

Summary

Introduction

Investigation into the properties and characteristics of classification algorithms forms a significant component of all research in machine learning. Data mining is an intrinsically practical exercise and our interest is in answering the following question: if we have a new classification problem or set of problems, what family of models should we use given our computational constraints? A dataset D of size n is a set of attribute vectors with an associated observation of a class variable (the response), D = {(x1, y1), . A learning algorithm L, takes a training dataset Dr and constructs a classifier or model M. The final model M produced by L by training on Dr is evaluated on a test dataset De. A classifier M is a mapping from the space of possible attribute vectors to the space of possible probability distributions over the c valid values of the class variable, M(x) = p, where p = { p(y = 1|M, x), . Given p , the estimate of the response is the value with the maximum probability

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A probabilistic classifier ensemble weighting scheme based on cross-validated accuracy estimates

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Data Mining and Knowledge Discovery

Lead the way for us

Similar Papers

Ensemble-Based Online Machine Learning Algorithms for Network Intrusion Detection Systems Using Streaming Data
Nathan Martindale ... Muhammad Ismail
Information | VOL. 11
Nathan Martindale, et. al.Nathan Martindale ... Muhammad Ismail
11 Jun 2020
Information | VOL. 11

Progressive Heterogeneous Ensemble Learning for Cancer Gene Expression Classification
Vishal Kore ... Vijay Khadse
-
Vishal Kore, et. al.Vishal Kore ... Vijay Khadse
01 Aug 2022
01 Aug 2022

Predicting student academic performance using multi-model heterogeneous ensemble approach
Olugbenga Wilson Adejo ... Thomas Connolly
Journal of Applied Research in Higher Education | VOL. 10
Olugbenga Wilson Adejo, et. al.Olugbenga Wilson Adejo ... Thomas Connolly
21 Dec 2017
Journal of Applied Research in Higher Education | VOL. 10

One tree to explain them all
Ulf Johansson ... Cecilia Sonstrod
-
Ulf Johansson, et. al.Ulf Johansson ... Cecilia Sonstrod
01 Jun 2011
01 Jun 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A probabilistic classifier ensemble weighting scheme based on cross-validated accuracy estimates

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Data Mining and Knowledge Discovery