Abstract

Chronic kidney disease (CKD) is a progressive condition characterized by the gradual deterioration of kidney functions, potentially leading to kidney failure if not promptly diagnosed and treated. Machine learning (ML) algorithms have shown significant promise in disease diagnosis, but in healthcare, clinical data pose challenges: missing values, noisy inputs, and redundant features, affecting early-stage CKD prediction. Thus, this study presents a novel, fully automated machine learning approach to tackle these complexities by incorporating feature selection (FS) and feature space reduction (FSR) techniques, leading to a substantial enhancement of the model’s performance. A data balancing technique is also employed during preprocessing to address data imbalance issue that is commonly encountered in clinical contexts. Finally, for reliable CKD classification, an ensemble characteristics-based classifier is encouraged. The effectiveness of our approach is rigorously validated and assessed on multiple datasets, and the clinical relevancy of the strategy is evaluated on the real-world therapeutic data collected from Bangladeshi patients. The study establishes the dominance of adaptive boosting, logistic regression, and passive aggressive ML classifiers with 96.48% accuracy in forecasting unseen therapeutic CKD data, particularly in early-stage cases. Furthermore, the effectiveness of the FSR technique in reducing the prediction time significantly is revealed. The outstanding performance of the proposed model demonstrates its effectiveness in addressing the complexity of healthcare CKD data by incorporating the FS and FSR techniques. This highlights its potential as a promising computer-aided diagnosis tool for doctors, enabling early interventions and improving patient outcomes.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call