Prioritizing positive feature values: a new hierarchical feature selection method

Pablo Nascimento Da Silva,Alexandre Plastino,Alex A Freitas

doi:10.1007/s10489-020-01782-5

Pablo Nascimento Da Silva, Alexandre Plastino + Show 1 more

Open Access

https://doi.org/10.1007/s10489-020-01782-5

Copy DOI

Abstract

In this work we address the problem of feature selection for the classification task in hierarchical and sparse feature spaces, which characterize many real-world applications nowadays. A binary feature space is deemed hierarchical when its binary features are related via generalization-specialization relationships, and is considered sparse when in general the instances contain much fewer “positive” than “negative” feature values. In any given instance, a feature value is deemed positive (negative) when the property associated with the feature has been (has not been) observed for that instance. Although there are many methods for the traditional feature selection problem in the literature, the proper treatment to hierarchical feature structures is still a challenge. Hence, we introduce a novel hierarchical feature selection method that follows the lazy learning paradigm—selecting a feature subset tailored for each instance in the test set. Our strategy prioritizes the selection of features with positive values, since they tend to be more informative—the presence of a relatively rare property is usually a piece of more relevant information than the absence of that property. Experiments on different application domains have shown that the proposed method outperforms previous hierarchical feature selection methods and also traditional methods in terms of predictive accuracy, selecting smaller feature subsets in general.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Prioritizing positive feature values: a new hierarchical feature selection method

Abstract

Talk to us

Similar Papers

More From: Applied Intelligence

Lead the way for us

Journal: Applied Intelligence	Publication Date: Jul 20, 2020
Citations: 5

Similar Papers

Two methods for constructing a gene ontology-based feature network for a Bayesian network classifier and applications to datasets of aging-related genes
Cen Wan ... Alex A Freitas
-
Cen Wan, et. al.Cen Wan ... Alex A Freitas
09 Sep 2015
09 Sep 2015

An empirical evaluation of hierarchical feature selection methods for classification in bioinformatics datasets with gene ontology-based features
Cen Wan ... Alex A Freitas
Artificial Intelligence Review | VOL. 50
Cen Wan, et. al.Cen Wan ... Alex A Freitas
30 Jan 2017
Artificial Intelligence Review | VOL. 50

Harvestman: a framework for hierarchical feature learning and selection from whole genome sequencing data
Trevor S Frisby ... Quang Minh Hoang
BMC Bioinformatics | VOL. 22
Trevor S Frisby, et. al.Trevor S Frisby ... Quang Minh Hoang
01 Apr 2021
BMC Bioinformatics | VOL. 22

Robust hierarchical feature selection with a capped [formula omitted]-norm
Xinxin Liu ... Hong Zhao
Neurocomputing | VOL. 443
Xinxin Liu, et. al.Xinxin Liu ... Hong Zhao
10 Mar 2021
Neurocomputing | VOL. 443

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Prioritizing positive feature values: a new hierarchical feature selection method

Abstract

Talk to us

Similar Papers

More From: Applied Intelligence