Feature subset selection based on relevance

Hui Wang,David Bell,Fionn Murtagh

doi:10.1016/s0083-6656(97)00043-3

Abstract

In this paper an axiomatic characterisation of feature subset selection is presented. Two axioms are presented: sufficiency axiom—preservation of learning information, and necessity axiom—minimising encoding length. The sufficiency axiom concerns the existing dataset and is derived based on the following understanding: any selected feature subset should be able to describe the training dataset without losing information, i.e. it is consistent with the training dataset. The necessity axiom concerns the predictability and is derived from Occam's razor, which states that the simplest among different alternatives is preferred for prediction. The two axioms are then restated in terms of relevance in a concise form: maximising both the r( X; Y) and r( Y; X) relevance. Based on the relevance characterisation, four feature subset selection algorithms are presented and analysed: one is exhaustive and the remaining three are heuristic. Experimentation is also presented and the results are encouraging. Comparison is also made with some well-known feature subset selection algorithms, in particular, with the built-in feature selection mechanism in C4.5.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Feature subset selection based on relevance

Abstract

Talk to us

Similar Papers

More From: Vistas in Astronomy

Lead the way for us

Journal: Vistas in Astronomy	Publication Date: Jan 1, 1997
Citations: 9

Similar Papers

Relevance Approach to Feature Subset Selection
Hui Wang ... Fionn Murtagh
-
Hui Wang, et. al.Hui Wang ... Fionn Murtagh
01 Jan 1998
01 Jan 1998

Fisher score and Matthews correlation coefficient-based feature subset selection for heart disease diagnosis using support vector machines
Syed Muhammad Saqlain ... Muhammad Sher
Knowledge and Information Systems | VOL. 58
Syed Muhammad Saqlain, et. al.Syed Muhammad Saqlain ... Muhammad Sher
26 Mar 2018
Knowledge and Information Systems | VOL. 58

Mixed-variable ant colony optimisation algorithm for feature subset selection and tuning support vector machine parameter
Hiba Basim Alwan ... Ku Ruhana Ku Mahamud
International Journal of Bio-Inspired Computation | VOL. 9
Hiba Basim Alwan, et. al.Hiba Basim Alwan ... Ku Ruhana Ku Mahamud
01 Jan 2017
International Journal of Bio-Inspired Computation | VOL. 9

A two-stage Markov blanket based feature selection algorithm for text classification
Kashif Javed ... Haroon A Babri
Neurocomputing | VOL. 157
Kashif Javed, et. al.Kashif Javed ... Haroon A Babri
27 Jan 2015
Neurocomputing | VOL. 157

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Feature subset selection based on relevance

Abstract

Talk to us

Similar Papers

More From: Vistas in Astronomy