Prediction of diseases is sensitive as any error can result in the wrong person's treatment or not treating the right patient. Besides, some features distinguish a disease from curable to fatal or curable to chronic disease. Data mining techniques have been widely used in health-related research. The researchers, so far, could attain around 97 percent accuracy using several methods. Some researchers have demonstrated that the selection of correct features increases the prediction accuracy. This research work propose a method to distinguish between chronic and non-chronic kidney disease, identify its crucial features without reducing the accuracy of prediction, and a prediction algorithm to eliminate the possibility of under or overfitting. This study uses the recursive feature elimination (RFE) method that selects an optimal subset of features and an ensemble algorithm, the enhanced decision tree (EDT), to predict the disease. The results obtained in this paper show that the accuracy level of EDT is not changed with the removal of less significant features, thus enabling the decision-makers to concentrate on few features to reduce time and error of treatment. EDT establishes substantially high consistency in predicting, with or without feature selection, the disease.
Read full abstract