Efficient and decision boundary aware instance selection for support vector machines

Mohammad Aslani,Stefan Seipel

doi:10.1016/j.ins.2021.07.015

Mohammad Aslani, Stefan Seipel

Open Access

https://doi.org/10.1016/j.ins.2021.07.015

Copy DOI

Journal: Information Sciences	Publication Date: Jul 6, 2021
Citations: 24	License type: cc-by

Affiliation: University of Gävle, Uppsala University

Abstract

Support vector machines (SVMs) are powerful classifiers that have high computational complexity in the training phase, which can limit their applicability to large datasets. An effective approach to address this limitation is to select a small subset of the most representative training samples such that desirable results can be obtained. In this study, a novel instance selection method called border point extraction based on locality-sensitive hashing (BPLSH) is designed. BPLSH preserves instances that are near the decision boundaries and eliminates nonessential ones. The performance of BPLSH is benchmarked against four approaches on different classification problems. The experimental results indicate that BPLSH outperforms the other methods in terms of classification accuracy, preservation rate, and execution time. The source code of BPLSH can be found in https://github.com/mohaslani/BPLSH.

Highlights

Support vector machines (SVMs) are effective classifiers with a definite theoretical foundation and have been extensively used in various applications in different fields, such as data mining [42], remote sensing [32], and geoscience [40]
For an unbiased and reliable evaluation of the instance selection methods, a repeated stratified q-fold cross-validation scheme is used
A given dataset is partitioned into q exclusive folds, and each time, q-1 folds are utilized to train the SVM after an instance selection method is applied to them

Summary

Introduction

Support vector machines (SVMs) are effective classifiers with a definite theoretical foundation and have been extensively used in various applications in different fields, such as data mining [42], remote sensing [32], and geoscience [40]. SVMs come with a minimal structural risk because they search for a separating hyperplane that represents the maximum margin between classes This feature makes SVMs more effective than other classifiers. Training an SVM, in which support vectors are oÀbtÁained, requires solving a quadratic programming optimization problem, which poses a computational complexity of O n3 , where n is the number of training samples This computational cost inhibits the applicability of SVMs to tasks involving large datasets, such as feature extraction from high-resolution aerial images. The instances with great potential to contribute to the classification and construction of demarcation hyperplanes are preserved These patterns, called support vector candidates, lie close to the border of classes, and they have been

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient and decision boundary aware instance selection for support vector machines

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information Sciences

Lead the way for us

Similar Papers

An Evolutionary Multiobjective Model and Instance Selection for Support Vector Machines With Pareto-Based Ensembles
Alejandro Rosales-Pérez ... Salvador García
IEEE Transactions on Evolutionary Computation | VOL. 21
Alejandro Rosales-Pérez, et. al.Alejandro Rosales-Pérez ... Salvador García
01 Dec 2017
IEEE Transactions on Evolutionary Computation | VOL. 21

Differential evolution-based parameters optimisation and feature selection for support vector machine
Jun Li ... Lixin Ding
International Journal of Computational Science and Engineering | VOL. 13
Jun Li, et. al.Jun Li ... Lixin Ding
01 Jan 2015
International Journal of Computational Science and Engineering | VOL. 13

Nonparallel hyperplane classifiers for multi-category classification
Pooja Saigal ... Reshma Khemchandani
-
Pooja Saigal, et. al.Pooja Saigal ... Reshma Khemchandani
01 Dec 2015
01 Dec 2015

A multi-objective evolutionary approach to training set selection for support vector machine
Giovanni Acampora ... Autilia Vitiello
Knowledge-Based Systems | VOL. 147
Giovanni Acampora, et. al.Giovanni Acampora ... Autilia Vitiello
13 Feb 2018
Knowledge-Based Systems | VOL. 147

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient and decision boundary aware instance selection for support vector machines

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information Sciences