Abstract

To make healthcare available and easily accessible, the Internet of Things (IoT), which paved the way to the construction of smart cities, marked the birth of many smart applications in numerous areas, including healthcare. As a result, smart healthcare applications have been and are being developed to provide, using mobile and electronic technology, higher diagnosis quality of the diseases, better treatment of the patients, and improved quality of lives. Since smart healthcare applications that are mainly concerned with the prediction of healthcare data (like diseases for example) rely on predictive healthcare data analytics, it is imperative for such predictive healthcare data analytics to be as accurate as possible. In this paper, we will exploit supervised machine learning methods in classification and regression to improve the performance of the traditional Random Forest on healthcare datasets, both in terms of accuracy and classification/regression speed, in order to produce an effective and efficient smart healthcare application, which we have termed eGAP. eGAP uses the evolutionary game theoretic approach replicator dynamics to evolve a Random Forest ensemble. Trees of high resemblance in an initial Random Forest are clustered, and then clusters grow and shrink by adding and removing trees using replicator dynamics, according to the predictive accuracy of each subforest represented by a cluster of trees. All clusters have an initial number of trees that is equal to the number of trees in the smallest cluster. Cluster growth is performed using trees that are not initially sampled. The speed and accuracy of the proposed method have been demonstrated by an experimental study on 10 classification and 10 regression medical datasets.

Highlights

  • The advent of the Internet of Things (IoT) led to the construction of smart cities [1], which were mainly developed to provide high degree of information technology, and a comprehensive application of information resources

  • CLUB-DRF is able to find a subset of trees from a typically constructed Random Forest (RF) that, in most cases, provides an even higher classification accuracy with only 5% of the trees used in RF

  • In this paper, we propose using a game theoretic approach, namely, replicator dynamics to optimise the choice of a different number of trees per cluster

Read more

Summary

Introduction

The advent of the Internet of Things (IoT) led to the construction of smart cities [1], which were mainly developed to provide high degree of information technology, and a comprehensive application of information resources. Many smart applications have come into existence in many areas including but not limited to smart energy [2], smart education [3], smart transportation [4], and smart healthcare [5]. For the latter area, which is the relevant one to the research conducted in this paper, many smart healthcare applications were developed as outlined in [6]. CLUB-DRF uses clustering of trees based on similarity of prediction. In its worst case scenario, it can reach 2t, where t is the number of trees in the original RF

Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call