The Boosting and Bootstrap Ensembles for the Pair Classifier Based on the Dual Indiscernibility Matrix

Piotr Artiemjew,Lech Polkowski,Przemyslaw Gorecki,Bartosz Nowak

doi:10.1007/978-3-319-54966-8_20

Abstract

In search for subsets of data sets relevant in the classification of data tasks, we have exploited the betweenness relation adopted from axiomatic elementary geometry. It has turned out that this relation can partition data into two subsets, one of them dense in the sense that each object in this set is a convex combination of a finite number of objects in it, the other, to the contrary, consisting of objects endowed with outliers, i.e., pairs of (attribute, value) not possessed by any other object. A technical tool for singling out those subsets is the Dual Indiscernibility Matrix defined as a counterpart to a well-known Discernibility Matrix of Skowron-Rauszer. On the basis of those ideas, the pair Classifier has been introduced. It is its main feature that test objects are approximated to a certain degree by pairs of training objects which are not required to cover the object completely. In this chapter, we examine selected methods for stabilization of the Pair Classifier like Bootstrap Ensemble, Arcing based Bootstrap, Ada–Boost with Monte Carlo split. We present results of experiments with some standard data sets. Consecutive sections are dedicated to basics of the method: Dual Indiscernibility Matrix, Kernel and residuum in data sets, Pair Classifier, Experiments, Discussion of results, Conclusion and Perspectives.

Full Text