Abstract

Chronic diseases represent a serious threat to public health across the world. It is estimated at about 60% of all deaths worldwide and approximately 43% of the global burden of chronic diseases. Thus, the analysis of the healthcare data has helped health officials, patients, and healthcare communities to perform early detection for those diseases. Extracting the patterns from healthcare data has helped the healthcare communities to obtain complete medical data for the purpose of diagnosis. The objective of the present research work is presented to improve the surveillance detection system for chronic diseases, which is used for the protection of people's lives. For this purpose, the proposed system has been developed to enhance the detection of chronic disease by using machine learning algorithms. The standard data related to chronic diseases have been collected from various worldwide resources. In healthcare data, special chronic diseases include ambiguous objects of the class. Therefore, the presence of ambiguous objects shows the availability of traits involving two or more classes, which reduces the accuracy of the machine learning algorithms. The novelty of the current research work lies in the assumption that demonstrates the noncrisp Rough K-means (RKM) clustering for figuring out the ambiguity in chronic disease dataset to improve the performance of the system. The RKM algorithm has clustered data into two sets, namely, the upper approximation and lower approximation. The objects belonging to the upper approximation are favourable objects, whereas the ones belonging to the lower approximation are excluded and identified as ambiguous. These ambiguous objects have been excluded to improve the machine learning algorithms. The machine learning algorithms, namely, naïve Bayes (NB), support vector machine (SVM), K-nearest neighbors (KNN), and random forest tree, are presented and compared. The chronic disease data are obtained from the machine learning repository and Kaggle to test and evaluate the proposed model. The experimental results demonstrate that the proposed system is successfully employed for the diagnosis of chronic diseases. The proposed model achieved the best results with naive Bayes with RKM for the classification of diabetic disease (80.55%), whereas SVM with RKM for the classification of kidney disease achieved 100% and SVM with RKM for the classification of cancer disease achieved 97.53 with respect to accuracy metric. The performance measures, such as accuracy, sensitivity, specificity, precision, and F-score, are employed to evaluate the performance of the proposed system. Furthermore, evaluation and comparison of the proposed system with the existing machine learning algorithms are presented. Finally, the proposed system has enhanced the performance of machine learning algorithms.

Highlights

  • Chronic diseases are serious diseases because they pose a serious threat to people’s lives and persist over long periods.ey can impede the freedom and health of people who have physical disabilities. us, they further cause frustration of people who suffer from various health disabilities. e available vaccines and medicine cannot completely prevent chronic diseases because they show no indications in any case

  • The Rough K-means (RKM) algorithm is considered to handle these ambiguous objects so that the accuracy of the classification algorithms can be improved. e RKM algorithm is appropriately designed for detecting the ambiguity in the chronic disease datasets. e experimental results have shown that the performance of the proposed system is better than that of the conventional models

  • It is investigated that there are ambiguous objects that hinder the classification algorithms. e diabetes data contain seven instances and two classes. ese ambiguous objects are examined by RKM clustering to assist in determining the exact class of ambiguous diseases or the closest one. e dataset has been clustered for two clusters corresponding into two classes that are labelled variables in datasets. e RKM algorithm has clustered the ambiguous objects into upper approximation and lower approximation. ose objects that belong to upper approximation, which belongs to one or more cluster numbers, are excluded

Read more

Summary

Research Article

Soft Clustering for Enhancing the Diagnosis of Chronic Diseases over Machine Learning Algorithms. E objective of the present research work is presented to improve the surveillance detection system for chronic diseases, which is used for the protection of people’s lives. For this purpose, the proposed system has been developed to enhance the detection of chronic disease by using machine learning algorithms. E novelty of the current research work lies in the assumption that demonstrates the noncrisp Rough K-means (RKM) clustering for figuring out the ambiguity in chronic disease dataset to improve the performance of the system. E chronic disease data are obtained from the machine learning repository and Kaggle to test and evaluate the proposed model. The proposed system has enhanced the performance of machine learning algorithms

Introduction
Machine learning algorithms
Feature name
TP FP
Cluster number Lower approximation Upper approximation
KNN with RKM
Kidney Diabetes Kidney Breast cancer Kidney Diabetes Breast cancer
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call