Guiding feature subset selection with an interactive visualization

Thorsten May,Andreas Bannach,Tobias Ruppert,James Davey,Jorn Kohlhammer

doi:10.1109/vast.2011.6102448

Abstract

We propose a method for the semi-automated refinement of the results of feature subset selection algorithms. Feature subset selection is a preliminary step in data analysis which identifies the most useful subset of features (columns) in a data table. So-called filter techniques use statistical ranking measures for the correlation of features. Usually a measure is applied to all entities (rows) of a data table. However, the differing contributions of subsets of data entities are masked by statistical aggregation. Feature and entity subset selection are, thus, highly interdependent. Due to the difficulty in visualizing a high-dimensional data table, most feature subset selection algorithms are applied as a black box at the outset of an analysis. Our visualization technique, SmartStripes, allows users to step into the feature subset selection process. It enables the investigation of dependencies and interdependencies between different feature and entity subsets. A user may even choose to control the iterations manually, taking into account the ranking measures, the contributions of different entity subsets, as well as the semantics of the features.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Guiding feature subset selection with an interactive visualization

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Relaxed Linear Separability (RLS) Approach to Feature (Gene) Subset Selection
Leon Bobrowski ... Tomasz Ukaszuk
-
Leon Bobrowski, et. al.Leon Bobrowski ... Tomasz Ukaszuk
19 Oct 2011
19 Oct 2011

Feature subset selection using multi-objective genetic algorithms
Kashif Waqas ... Rauf Baig
-
Kashif Waqas, et. al.Kashif Waqas ... Rauf Baig
01 Dec 2009
01 Dec 2009

A Feature Subset Selection Algorithm Automatic Recommendation Method
G Wang ... Q Song
Journal of Artificial Intelligence Research | VOL. 47
G Wang, et. al.G Wang ... Q Song
15 May 2013
Journal of Artificial Intelligence Research | VOL. 47

Feature Selection Based on Class-Dependent Densities for High-Dimensional Binary Data
Kashif Javed ... Haroon A Babri
IEEE Transactions on Knowledge and Data Engineering | VOL. 24
Kashif Javed, et. al.Kashif Javed ... Haroon A Babri
01 Mar 2012
IEEE Transactions on Knowledge and Data Engineering | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Guiding feature subset selection with an interactive visualization

Abstract

Talk to us

Similar Papers