R-HEFS: Rough set based heterogeneous ensemble feature selection method for medical data classification

Rubul Kumar Bania,Anindya Halder

doi:10.1016/j.artmed.2021.102049

Abstract

Feature selection is one of the trustworthy processes of dimensionality reduction technique to select a subset of relevant and non-redundant features from large datasets. Ensemble feature selection (EFS) approach is a recent technique aiming at accumulating diversity in the subset of selected features. It improves the performance of learning algorithms and obtains more stable and robust results. In this paper, a novel rough set theory (RST) based heterogeneous EFS method (R-HEFS) is proposed for selecting the less redundant and highly relevant features during the aggregation of diverse feature subsets by applying the feature-class, feature-feature rough dependency and feature-significance measures. In R-HEFS five state-of-the-art RST based filter methods are used as a base feature selectors. Experiments are carried out on 10 benchmark medical datasets collected from the UCI repository. For the imputation of the missing values and discretization of the continuous features, k nearest neighbor (kNN) imputation method and RST based discretization techniques are applied. The effectiveness of the proposed R-HEFS method is evaluated and analyzed by using four benchmark classifiers viz., Naïve Bayes (NB), random forest (RF), support vector machine (SVM), and AdaBoost. The proposed R-HEFS method turns out to be effective by removing the non-relevant and redundant features during the process of aggregation of base feature selectors and it assists to increase the classification accuracy. Out of 10 different medical datasets, on 7 datasets, R-HEFS has achieved better average classification accuracy. So, the overall results strongly suggest that the proposed R-HEFS method can reduce the dimension of large medical datasets and may help the physicians or medical experts to diagnose (classify) different diseases with lesser computational complexities.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

R-HEFS: Rough set based heterogeneous ensemble feature selection method for medical data classification

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence in Medicine

Lead the way for us

Journal: Artificial Intelligence in Medicine	Publication Date: Mar 6, 2021
Citations: 38

Similar Papers

R-Ensembler: A greedy rough set based ensemble attribute selection algorithm with kNN imputation for classification of medical data
Rubul Kumar Bania ... Anindya Halder
Computer Methods and Programs in Biomedicine | VOL. 184
Rubul Kumar Bania, et. al.Rubul Kumar Bania ... Anindya Halder
08 Oct 2019
Computer Methods and Programs in Biomedicine | VOL. 184

Feature selection and its combination with data over-sampling for multi-class imbalanced datasets
Chih-Fong Tsai ... Wei-Chao Lin
Applied Soft Computing | VOL. 153
Chih-Fong Tsai, et. al.Chih-Fong Tsai ... Wei-Chao Lin
17 Jan 2024
Applied Soft Computing | VOL. 153

A novel hybrid credit scoring model based on ensemble feature selection and multilayer ensemble classification
Diwakar Tripathi ... Venkatanareshbabu Kuppili
Computational Intelligence | VOL. 35
Diwakar Tripathi, et. al.Diwakar Tripathi ... Venkatanareshbabu Kuppili
07 Mar 2019
Computational Intelligence | VOL. 35

Variable selection strategies for nearest neighbor imputation methods used in remote sensing based forest inventory
Petteri Packalén ... Matti Maltamo
Canadian Journal of Remote Sensing | VOL. 38
Petteri Packalén, et. al.Petteri Packalén ... Matti Maltamo
20 Nov 2012
Canadian Journal of Remote Sensing | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

R-HEFS: Rough set based heterogeneous ensemble feature selection method for medical data classification

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence in Medicine