Abstract

Data mining is mostly utilized for a huge variety of applications in several fields like education, medical, surveillance, and industries. The clustering is an important method of data mining, in which data elements are divided into groups (clusters) to provide better quality data analysis. The Biogeography-Based Optimization (BO) is the latest metaheuristic approach, which is applied to resolve several complex optimization problems. Here, a Chaotic Biogeography-Based Optimization approach using Information Entropy (CBO-IE) is implemented to perform clustering over healthcare IoT datasets. The main objective of CBO-IE is to provide proficient and precise data point distribution in datasets by using Information Entropy concepts and to initialize the population by using chaos theory. Both Information Entropy and chaos theory are facilitated to improve the convergence speed of BO in global search area for selecting the cluster heads and cluster members more accurately. The CBO-IE is implemented to a MATLAB 2021a tool over eight healthcare IoT datasets, and the results illustrate the superior performance of CBO-IE based on F-Measure, intracluster distance, running time complexity, purity index, statistical analysis, root mean square error, accuracy, and standard deviation as compared to previous techniques of clustering like K-Means, GA, PSO, ALO, and BO approaches.

Highlights

  • Data mining is mostly utilized for a huge variety of applications in several fields like education, medical, surveillance, and industries. e clustering is an important method of data mining, in which data elements are divided into groups to provide better quality data analysis. e Biogeography-Based Optimization (BO) is the latest metaheuristic approach, which is applied to resolve several complex optimization problems

  • A Chaotic Biogeography-Based Optimization approach using Information Entropy (CBO-IE) is implemented to perform clustering over healthcare IoT datasets. e main objective of CBO-IE is to provide proficient and precise data point distribution in datasets by using Information Entropy concepts and to initialize the population by using chaos theory. Both Information Entropy and chaos theory are facilitated to improve the convergence speed of BO in global search area for selecting the cluster heads and cluster members more accurately. e CBO-IE is implemented to a MATLAB 2021a tool over eight healthcare IoT datasets, and the results illustrate the superior performance of CBO-IE based on FMeasure, intracluster distance, running time complexity, purity index, statistical analysis, root mean square error, accuracy, and standard deviation as compared to previous techniques of clustering like K-Means, Genetic Algorithm (GA), Particle Swarm Optimization (PSO), Ant Lion Optimization (ALO), and BO approaches

  • Introduction e big data [1, 2] is used and analyzed in several wireless applications by utilizing various characteristics like storage, processing, and maintenance of data. e preprocessing of a huge amount of data is performed before analysis to reduce the data redundancy with enhancing the data accuracy and efficiency [3,4,5]. e big data is processed by using various nature inspired optimization approaches like Genetic Algorithm (GA), Ant Colony Optimization (ACO), Ant Lion Optimization (ALO), and Particle Swarm Optimization (PSO) to perform optimal analysis [6]

Read more

Summary

Research Article

CBO-IE: A Data Mining Approach for Healthcare IoT Dataset Using Chaotic Biogeography-Based Optimization and Information Entropy. A Chaotic Biogeography-Based Optimization approach using Information Entropy (CBO-IE) is implemented to perform clustering over healthcare IoT datasets. (3) e CBO-IE is applied over eight healthcare IoT datasets and the outputs are evaluated on the basis of F-Measure, intracluster distance, running time complexity, purity index, statistical analysis, standard deviation, root mean square error, and accuracy as compared to previous techniques of clustering like K-Means, GA, PSO, ALO, and BO approaches. Various researchers mainly solved the issues of infinite size and perception development, but the perception flow and characteristic assessment are not taken into consideration by researchers These two issues are removed in implementing a classification method over big data and parameters like error rate and running time are obtained to provide superior efficiency of the method [34]. Where Tmaximum is the highest mutation rate term illustrated by user and Smaximum is the highest possibility of castes count

After that the mutation operator is improved by utilizing
Ea maximum
Evaluate HSI value END WHILE
Results and Analysis
No Yes Result the optimal solution
No of clusters
Standard Deviation
Blood transfusion
Purity Index Thoracic Surgery
GA PSO ALO BO
Diabetic retinopathy
Rejected Rejected Rejected Rejected Rejected
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call