Feature Selection and Classification of Clinical Datasets Using Bioinspired Algorithms and Super Learner.

S Murugesan,S Keerthana Sankari,R S Bhuvaneswaran,H Khanna Nehemiah,Y Nancy Jane

doi:10.1155/2021/6662420

S Murugesan, S Keerthana Sankari + Show 3 more

Open Access

https://doi.org/10.1155/2021/6662420

Copy DOI

Abstract

A computer-aided diagnosis (CAD) system that employs a super learner to diagnose the presence or absence of a disease has been developed. Each clinical dataset is preprocessed and split into training set (60%) and testing set (40%). A wrapper approach that uses three bioinspired algorithms, namely, cat swarm optimization (CSO), krill herd (KH) ,and bacterial foraging optimization (BFO) with the classification accuracy of support vector machine (SVM) as the fitness function has been used for feature selection. The selected features of each bioinspired algorithm are stored in three separate databases. The features selected by each bioinspired algorithm are used to train three back propagation neural networks (BPNN) independently using the conjugate gradient algorithm (CGA). Classifier testing is performed by using the testing set on each trained classifier, and the diagnostic results obtained are used to evaluate the performance of each classifier. The classification results obtained for each instance of the testing set of the three classifiers and the class label associated with each instance of the testing set will be the candidate instances for training and testing the super learner. The training set comprises of 80% of the instances, and the testing set comprises of 20% of the instances. Experimentation has been carried out using seven clinical datasets from the University of California Irvine (UCI) machine learning repository. The super learner has achieved a classification accuracy of 96.83% for Wisconsin diagnostic breast cancer dataset (WDBC), 86.36% for Statlog heart disease dataset (SHD), 94.74% for hepatocellular carcinoma dataset (HCC), 90.48% for hepatitis dataset (HD), 81.82% for vertebral column dataset (VCD), 84% for Cleveland heart disease dataset (CHD), and 70% for Indian liver patient dataset (ILP).

Highlights

Data related to symptoms observed on a patient at a point of time are stored in electronic health records (EHRs)
Seven clinical datasets from the University of California Irvine (UCI) ML repository, namely, Wisconsin diagnostic breast cancer dataset (WDBC), Statlog heart disease dataset (SHD), hepatocellular carcinoma dataset (HCC), hepatitis dataset (HD), vertebral column dataset (VCD), Cleveland heart disease dataset (CHD), and Indian liver patient dataset (ILP) have been used for experimentation
The performance of the FCSO, FKH, and FBFO classifiers and super learner is evaluated in terms of accuracy, sensitivity, specificity, precision, and F -score, which are calculated based on true positive (TP), true negative (TN), false positive (FP), and false negative (FN) using Equations (22), (23), (24), (25), and (26)

Summary

Introduction

Data related to symptoms observed on a patient at a point of time are stored in electronic health records (EHRs). The filter method considers the dependency of each feature to the class label and is independent of any classification algorithm. Knowledge mining using rough sets for feature selection and backpropagation neural network (BPNN) for classifying clinical datasets has been proposed in [7]. A computer-aided diagnostic system that uses a neural network classifier trained using differential evolution, particle swarm optimization, and gradient descent backpropagation algorithms is proposed in [20]. A radial basis function neural network to classify clinical datasets using k-means clustering algorithm and quantum-behaved particle swarm optimization is proposed in [21]. A framework to classify unevenly spaced time series clinical data using improved double exponential smoothing, rough sets, neural network, and fuzzy logic is proposed in [23].

Literature Survey

System Framework

Results and Discussions

Conclusion and Scope for Future Work

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computational and mathematical methods in medicine	Publication Date: May 17, 2021
Citations: 22	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Feature Selection and Classification of Clinical Datasets Using Bioinspired Algorithms and Super Learner.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computational and mathematical methods in medicine

Lead the way for us

Similar Papers

Correlation-Based Ensemble Feature Selection Using Bioinspired Algorithms and Classification Using Backpropagation Neural Network.
V R Elgin Christo ... H Khanna Nehemiah
Computational and Mathematical Methods in Medicine | VOL. 2019
V R Elgin Christo, et. al.V R Elgin Christo ... H Khanna Nehemiah
23 Sep 2019
Computational and Mathematical Methods in Medicine | VOL. 2019

Classification Framework for Clinical Datasets Using Synergistic Firefly Optimization
V R Elgin Christo ... A Kannan
IETE Journal of Research | VOL. 69
V R Elgin Christo, et. al.V R Elgin Christo ... A Kannan
11 Dec 2021
IETE Journal of Research | VOL. 69

Hybrid approach using fuzzy sets and extreme learning machine for classifying clinical datasets
Kindie Biredagn Nahato ... A Kannan
Informatics in Medicine Unlocked | VOL. 2
Kindie Biredagn Nahato, et. al.Kindie Biredagn Nahato ... A Kannan
01 Jan 2015
Informatics in Medicine Unlocked | VOL. 2

Hybrid Ensemble Feature Selection for Heart Disease Prediction System Using an NMF Hierarchical Clustering
A.V Senthil Kumar
International Journal of Data Mining And Emerging Technologies | VOL. 5
A.V Senthil KumarA.V Senthil Kumar
01 Jan 2015
International Journal of Data Mining And Emerging Technologies | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Feature Selection and Classification of Clinical Datasets Using Bioinspired Algorithms and Super Learner.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computational and mathematical methods in medicine