A novel filter–wrapper hybrid greedy ensemble approach optimized using the genetic algorithm to reduce the dimensionality of high-dimensional biomedical datasets

Tushaar Gangavarapu,Nagamma Patil

doi:10.1016/j.asoc.2019.105538

Abstract

The predictive accuracy of high-dimensional biomedical datasets is often dwindled by many irrelevant and redundant molecular disease diagnosis features. Dimensionality reduction aims at finding a feature subspace that preserves the predictive accuracy while eliminating noise and curtailing the high computational cost of training. The applicability of a particular feature selection technique is heavily reliant on the ability of that technique to match the problem structure and to capture the inherent patterns in the data. In this paper, we propose a novel filter–wrapper hybrid ensemble feature selection approach based on the weighted occurrence frequency and the penalty scheme, to obtain the most discriminative and instructive feature subspace. The proposed approach engenders an optimal feature subspace by greedily combining the feature subspaces obtained from various predetermined base feature selection techniques. Furthermore, the base feature subspaces are penalized based on specific performance dependent penalty parameters. We leverage effective heuristic search strategies including the greedy parameter-wise optimization and the Genetic Algorithm (GA) to optimize the subspace ensembling process. The effectiveness, robustness, and flexibility of the proposed hybrid greedy ensemble approach in comparison with the base feature selection techniques, and prolific filter and state-of-the-art wrapper methods are justified by empirical analysis on three distinct high-dimensional biomedical datasets. Experimental validation revealed that the proposed greedy approach, when optimized using GA, outperformed the selected base feature selection techniques by 4.17%–15.14% in terms of the prediction accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A novel filter–wrapper hybrid greedy ensemble approach optimized using the genetic algorithm to reduce the dimensionality of high-dimensional biomedical datasets

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing

Lead the way for us

Journal: Applied Soft Computing	Publication Date: Jun 3, 2019
Citations: 37

Similar Papers

Feature Selection for Optimized High-Dimensional Biomedical Data Using an Improved Shuffled Frog Leaping Algorithm
Bin Hu ... Xiaowei Zhang
IEEE/ACM Transactions on Computational Biology and Bioinformatics | VOL. 15
Bin Hu, et. al.Bin Hu ... Xiaowei Zhang
24 Aug 2016
IEEE/ACM Transactions on Computational Biology and Bioinformatics | VOL. 15

A Review of Feature Selection Techniques in Sentiment Analysis Using Filter, Wrapper, or Hybrid Methods
Pulung Hendro Prastyo ... Igi Ardiyanto
-
Pulung Hendro Prastyo, et. al.Pulung Hendro Prastyo ... Igi Ardiyanto
07 Sep 2020
07 Sep 2020

Evolutionary Multitask Ensemble Learning Model for Hyperspectral Image Classification
Jiao Shi ... Yu Lei
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 14
Jiao Shi, et. al.Jiao Shi ... Yu Lei
12 Nov 2020
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 14

An ensemble method of feature selection and classification of Odia characters
Mamatarani Das ... Mrutyunjaya Panda
-
Mamatarani Das, et. al.Mamatarani Das ... Mrutyunjaya Panda
08 Jan 2021
08 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A novel filter–wrapper hybrid greedy ensemble approach optimized using the genetic algorithm to reduce the dimensionality of high-dimensional biomedical datasets

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing