Sparse Proximal Support Vector Machines for feature selection in high dimensional datasets

Vijay Pappu,Orestis P Panagopoulos,Petros Xanthopoulos,Panos M Pardalos

doi:10.1016/j.eswa.2015.08.022

Abstract

Classification of High Dimension Low Sample Size (HDLSS) datasets is a challenging task in supervised learning. Such datasets are prevalent in various areas including biomedical applications and business analytics. In this paper, a new embedded feature selection method for HDLSS datasets is introduced by incorporating sparsity in Proximal Support Vector Machines (PSVMs). Our method, called Sparse Proximal Support Vector Machines (sPSVMs), learns a sparse representation of PSVMs by first casting it as an equivalent least squares problem and then introducing the l1-norm for sparsity. An efficient algorithm based on alternating optimization techniques is proposed. sPSVMs remove more than 98% of features in many high dimensional datasets without compromising on generalization performance. Stability in the feature selection process of sPSVMs is also studied and compared with other univariate filter techniques. Additionally, sPSVMs offer the advantage of interpreting the selected features in the context of the classes by inducing class-specific local sparsity instead of global sparsity like other embedded methods. sPSVMs appear to be robust with respect to data dimensionality. Moreover, sPSVMs are able to perform feature selection and classification in one step, eliminating the need for dimensionality reduction on the data. To that end, sPSVMs can be used for preprocessing free classification tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Sparse Proximal Support Vector Machines for feature selection in high dimensional datasets

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications

Lead the way for us

Journal: Expert Systems with Applications	Publication Date: Aug 20, 2015
Citations: 25

Similar Papers

D.C. programming for sparse proximal support vector machines
Guoquan Li ... Changzhi Wu
Information Sciences | VOL. 547
Guoquan Li, et. al.Guoquan Li ... Changzhi Wu
14 Aug 2020
Information Sciences | VOL. 547

Sparse Proximal Support Vector Machine with a Specialized Interior-Point Method
Yan-Qin Bai ... Zhao-Ying Zhu
Journal of the Operations Research Society of China | VOL. 3
Yan-Qin Bai, et. al.Yan-Qin Bai ... Zhao-Ying Zhu
26 Feb 2015
Journal of the Operations Research Society of China | VOL. 3

Fault Identification of Vehicle Automatic Transmission based on Sparse Autoencoder and Support Vector Machine
Canyi Du ... Feifei Yu
IOP Conference Series: Materials Science and Engineering | VOL. 490
Canyi Du, et. al.Canyi Du ... Feifei Yu
01 Apr 2019
IOP Conference Series: Materials Science and Engineering | VOL. 490

Simultaneous classification and feature selection via LOG SVM and Elastic LOG SVM
Jian-Wei Liu ... Li-Peng Cui
-
Jian-Wei Liu, et. al.Jian-Wei Liu ... Li-Peng Cui
01 Jul 2017
01 Jul 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sparse Proximal Support Vector Machines for feature selection in high dimensional datasets

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications