A Feature Selection based on perturbation theory

Javad Rahimipour Anaraki,Hamid Usefi

doi:10.1016/j.eswa.2019.02.028

Abstract

Consider a supervised dataset D=[A∣b], where b is the outcome column, rows of D correspond to observations, and columns of A are the features of the dataset. A central problem in machine learning and pattern recognition is to select the most important features from D to be able to predict the outcome. In this paper, we provide a new feature selection method where we use perturbation theory to detect correlations between features. We solve AX=b using the method of least squares and singular value decomposition of A. In practical applications, such as in bioinformatics, the number of rows of A (observations) are much less than the number of columns of A (features). So we are dealing with singular matrices with big condition numbers. Although it is known that the solutions of least square problems in singular case are very sensitive to perturbations in A, our novel approach in this paper is to prove that the correlations between features can be detected by applying perturbations to A. The effectiveness of our method is verified by performing a series of comparisons with conventional and novel feature selection methods in the literature. It is demonstrated that in most situations, our method chooses considerably less number of features while attaining or exceeding the accuracy of the other methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Feature Selection based on perturbation theory

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications

Lead the way for us

Journal: Expert Systems with Applications	Publication Date: Mar 5, 2019
Citations: 23

Similar Papers

High-dimensional feature selection for genomic datasets
Majid Afshar ... Hamid Usefi
Knowledge-Based Systems | VOL. 206
Majid Afshar, et. al.Majid Afshar ... Hamid Usefi
10 Aug 2020
Knowledge-Based Systems | VOL. 206

Semi-supervised graph embedding-based feature extraction and adaptive kernel-based classification for computer-aided detection in CT colonography
Lei Fan ... Xianfeng Gu
-
Lei Fan, et. al.Lei Fan ... Xianfeng Gu
01 Oct 2012
01 Oct 2012

Dimensionality reduction using singular vectors
Majid Afshar ... Hamid Usefi
Scientific Reports | VOL. 11
Majid Afshar, et. al.Majid Afshar ... Hamid Usefi
15 Feb 2021
Scientific Reports | VOL. 11

The performance of LS and SVD methods for SBAS InSAR deformation model solutions
Qiuxiang Tao ... Tongwen Liu
International Journal of Remote Sensing | VOL. 41
Qiuxiang Tao, et. al.Qiuxiang Tao ... Tongwen Liu
04 Sep 2020
International Journal of Remote Sensing | VOL. 41

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Feature Selection based on perturbation theory

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications