Selection of Support Vector Candidates Using Relative Support Distance for Sustainability in Large-Scale Support Vector Machines

Minho Ryu,Kichun Lee

doi:10.3390/app10196979

Abstract

Support vector machines (SVMs) are a well-known classifier due to their superior classification performance. They are defined by a hyperplane, which separates two classes with the largest margin. In the computation of the hyperplane, however, it is necessary to solve a quadratic programming problem. The storage cost of a quadratic programming problem grows with the square of the number of training sample points, and the time complexity is proportional to the cube of the number in general. Thus, it is worth studying how to reduce the training time of SVMs without compromising the performance to prepare for sustainability in large-scale SVM problems. In this paper, we proposed a novel data reduction method for reducing the training time by combining decision trees and relative support distance. We applied a new concept, relative support distance, to select good support vector candidates in each partition generated by the decision trees. The selected support vector candidates improved the training speed for large-scale SVM problems. In experiments, we demonstrated that our approach significantly reduced the training time while maintaining good classification performance in comparison with existing approaches.

Highlights

Support vector machines (SVMs) [1] have been a very powerful machine learning algorithm developed for classification problems, which works by recognizing patterns via kernel tricks [2]
Its nonlinear separation can be achieved via kernel functions, which map the input space to a high-dimensional space, called feature space where optimal separating hyperplane is determined in the feature space
In order to cope with large-scale SVM problems, we propose a novel selection method for support vector candidates using a combination of tree decomposition and relative support distance

Summary

Introduction

Support vector machines (SVMs) [1] have been a very powerful machine learning algorithm developed for classification problems, which works by recognizing patterns via kernel tricks [2].Because of its high performance and great generalization ability compared with other classification methods, the SVM method is widely used in bioinformatics, text and image recognition, and finances, to name a few. The method finds a linear boundary (hyperplane) that represents the largest margin between two classes (labels) in the input space [3,4,5,6]. It can be applied to linear separation and nonlinear separation using kernel functions. Its nonlinear separation can be achieved via kernel functions, which map the input space to a high-dimensional space, called feature space where optimal separating hyperplane is determined in the feature space. The hyperplane in the feature space, which achieves a better separation of training data, is translated to a nonlinear boundary in the original space [7,8]. The kernel trick is used to associate the kernel function with the mapping function, bringing forth a nonlinear separation in the input space

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Oct 6, 2020
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Selection of Support Vector Candidates Using Relative Support Distance for Sustainability in Large-Scale Support Vector Machines

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Large-Scale Learning of Structure−Activity Relationships Using a Linear Support Vector Machine and Problem-Specific Metrics
Georg Hinselmann ... Claude Ostermann
Journal of Chemical Information and Modeling | VOL. 51
Georg Hinselmann, et. al.Georg Hinselmann ... Claude Ostermann
05 Jan 2011
Journal of Chemical Information and Modeling | VOL. 51

GPU Acceleration of Interior Point Methods in Large Scale SVM Training
Tao Li ... Shuai Zhang
-
Tao Li, et. al.Tao Li ... Shuai Zhang
01 Jul 2013
01 Jul 2013

Support vector candidates selection via Delaunay graph and convex-hull for large and high-dimensional datasets
Alexandre Reeberg De Mello ... Flávio Gabriel Oliveira Barbosa
Pattern Recognition Letters | VOL. 116
Alexandre Reeberg De Mello, et. al.Alexandre Reeberg De Mello ... Flávio Gabriel Oliveira Barbosa
06 Sep 2018
Pattern Recognition Letters | VOL. 116

A matrix-free smoothing algorithm for large-scale support vector machines
Tie Ni ... Jun Zhai
Information Sciences | VOL. 358-359
Tie Ni, et. al.Tie Ni ... Jun Zhai
08 Apr 2016
Information Sciences | VOL. 358-359

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Selection of Support Vector Candidates Using Relative Support Distance for Sustainability in Large-Scale Support Vector Machines

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences