Finding potential support vectors in separable classification problems.

Damiano Varagnolo,Simone Del Favero,Gianluigi Pillonetto,Luca Schenato,Francesco Dinuzzo

doi:10.1109/tnnls.2013.2264731

Damiano Varagnolo, Simone Del Favero + Show 3 more

Open Access

PDF Available

https://doi.org/10.1109/tnnls.2013.2264731

Copy DOI

Export

Save

Cite

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

This paper considers the classification problem using support vector (SV) machines and investigates how to maximally reduce the size of the training set without losing information. Under separable data set assumptions, we derive the exact conditions stating which observations can be discarded without diminishing the overall information content. For this purpose, we introduce the concept of potential SVs, i.e., those data that can become SVs when future data become available. To complement this, we also characterize the set of discardable vectors (DVs), i.e., those data that, given the current data set, can never become SVs. Thus, these vectors are useless for future training purposes and can eventually be removed without loss of information. Then, we provide an efficient algorithm based on linear programming that returns the potential and DVs by constructing a simplex tableau. Finally, we compare it with alternative algorithms available in the literature on some synthetic data as well as on data sets from standard repositories.

Full Text