Abstract
The real-time monitoring and tracking systems play a critical role in the healthcare field. Wearable medical devices with sensors, mobile applications, and health cloud have continuously generated an enormous amount of data, often called streaming big data. Due to the higher speed of the streaming data, it is difficult to ingest, process, and analyze such huge data in real-time to make real-time actions in case of emergencies. Using traditional methods that are inadequate and time-consuming. Therefore, there is a significant need for real-time big data stream processing to guarantee an effective and scalable solution. So, we proposed a new system for online prediction to predict health status using Spark streaming framework. The proposed system focuses on applying streaming machine learning models (i.e. streaming linear regression with SGD) on streaming health data events ingested to spark streaming through Kafka topics. The experimental results are done on the historical medical datasets (i.e. diabetes dataset, heart disease dataset, and breast cancer dataset) and generated dataset which is simulated to wearable medical sensors. The historical datasets have shown that the accuracy improvement ratio obtained using the diabetes disease dataset is the highest one with respect to the other two datasets with an accuracy of 81%. For generated datasets, the online prediction system has achieved accuracy with 98% at 5 seconds window size. Beyond this, the experimental results have proofed that the online prediction system can online learn and update the model according to the new data arrival and window size.
Highlights
Nowadays, the era has been described as the era of big data where all data is digitalized and becomes of great importance in such beautiful fields
Real-time healthcare analytical involves real-time streaming data processing, streaming machine learning algorithms, and analyzing real time to build an online electronic system to deal with the stream of healthcare data. we developed an online prediction healthcare system for streaming data coming from IoT devices to predict health status for the patient in real time
The historical data is ingested into the proposed system using three medical datasets chosen from UCI machine learning repository; Pima Indians diabetes [21], Cleveland heart disease [22] and breast cancer Coimbra [23]
Summary
The era has been described as the era of big data where all data is digitalized and becomes of great importance in such beautiful fields. In the healthcare field, a big data collected in real-time by a remote sensing device, Wearable Medical Devices, and other data gathering tools, which produce new challenges that focus on data size and the fast growth rate of these data [2]. The digital record of the patient’s medical history is the primary health care data as it is obtained from various types of health care data sources in both clinical and non-clinical settings. These digital data are not available for research
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: International Journal of Advanced Computer Science and Applications
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.