Comparative Analysis of ML Models with Selection Methods for Early Predictive Analytics of Sepsis in ICU

Vandana Vandana,Rita Chhikara

doi:10.17485/ijst/v17i41.1925

Abstract

Objectives: This study aims to make early prediction of Sepsis using ML algorithms and provide comparative analysis of different feature selection techniques. Methods: In this study, the Physionet website dataset has been used. Sepsis categorization is done on the basis of Sepsis-3 criteria. The dataset used is highly imbalanced, with only 2% of the data belonging to Sepsis. The SmoteTomek technique is used to handle an imbalanced dataset. Various filter, embedded, and wrapper feature selection techniques, like tree-based feature selection technique, recursive feature elimination (RFE), information gain, Bhattacharya distance, lasso, etc., have been applied for the top-performing classification ML models. These selection techniques were applied to RandomForest, K Nearest Neighbors (KNN), and Decision Tree models. We compared the impact of these selection techniques on the aforesaid machine learning models. Findings: After applying the RFE technique, the Area Under the Receiver Operating Characteristic curve (AUROC) score of the RandomForest model has slightly increased from 0.996 to 0.9974. KNN model with a tree-based feature selection technique showed the highest sensitivity of 0.934. which is slightly higher than the sensitivity of 0.922, which was without applying any feature selection technique. Along with the AUROC score, the highest performance, in terms of specificity (0.9976), accuracy (0.9959), and f-measure score (0.9062), is achieved when RFE is applied to the RandomForest model. The best selection algorithm for decision trees and KNN is the tree-based selection technique. RFE is the best selection technique for RandomForest. Novelty: In this research, the AUROC score is slightly increased to 0.9974, which has not been achieved yet. Instead of 40, the number of features chosen is 20. This research also provides a comparison of different feature selection techniques like tree-based feature selection, information gain, Bhattacharya, and RFE. It also analyses their impact on the performance of models, which has not been done yet with the same set of selection techniques. Keywords: Sepsis, Machine Learning, Feature selection, Early prediction, Predictive Analytics

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparative Analysis of ML Models with Selection Methods for Early Predictive Analytics of Sepsis in ICU

Abstract

Talk to us

Similar Papers

More From: Indian Journal Of Science And Technology

Lead the way for us

Similar Papers

-
-
Indian Journal Of Science And Technology | VOL. 17
--
18 Nov 2024
Indian Journal Of Science And Technology | VOL. 17

Comparative Analysis of ML Models with Selection Methods for Early Predictive Analytics of Sepsis in ICU
Vandana Vandana ... Rita Chhikara
Indian Journal Of Science And Technology | VOL. 17
Vandana Vandana, et. al.Vandana Vandana ... Rita Chhikara
18 Nov 2024
Indian Journal Of Science And Technology | VOL. 17

Classification of Liver Lesion Stages using pyRadiomics Features Combined with 3D-CNN in 3D-CT and US Images
A Bathsheba Parimala ... R S Shanmugasundaram
Indian Journal Of Science And Technology | VOL. 17
A Bathsheba Parimala, et. al.A Bathsheba Parimala ... R S Shanmugasundaram
18 Nov 2024
Indian Journal Of Science And Technology | VOL. 17

Review of Electrical Steels with their Properties and Recent Trends for Improvement
Asif Momin
Indian Journal Of Science And Technology | VOL. 17
Asif MominAsif Momin
18 Nov 2024
Indian Journal Of Science And Technology | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparative Analysis of ML Models with Selection Methods for Early Predictive Analytics of Sepsis in ICU

Abstract

Talk to us

Similar Papers

More From: Indian Journal Of Science And Technology