Abstract

A large volume of datasets is available in various fields that are stored to be somewhere which is called big data. Big Data healthcare has clinical data set of every patient records in huge amount and they are maintained by Electronic Health Records (EHR). More than 80 % of clinical data is the unstructured format and reposit in hundreds of forms. The challenges and demand for data storage, analysis is to handling large datasets in terms of efficiency and scalability. Hadoop Map reduces framework uses big data to store and operate any kinds of data speedily. It is not solely meant for storage system however conjointly a platform for information storage moreover as processing. It is scalable and fault-tolerant to the systems. Also, the prediction of the data sets is handled by machine learning algorithm. This work focuses on the Extreme Machine Learning algorithm (ELM) that can utilize the optimized way of finding a solution to find disease risk prediction by combining ELM with Cuckoo Search optimization-based Support Vector Machine (CS-SVM). The proposed work also considers the scalability and accuracy of big data models, thus the proposed algorithm greatly achieves the computing work and got good results in performance of both veracity and efficiency.

Highlights

  • As the technology is evolving in enormous kind the data size is extended correspondingly

  • RDBMS is used for handling bulky dataset propose work focused the big data that are stored in data Architectures like High Performance Computing System (HPCS) to store and analyse giant data sets reliably

  • For Structured data, Naïve Bayes (NB), Cuckoo Search optimization-based Support Vector Machine (CS-SVM), Decision tree (DT) Machine learning algorithm is employed to find the risk of fatal disease

Read more

Summary

INTRODUCTION

As the technology is evolving in enormous kind the data size is extended correspondingly. There is a lot of valuable information buried in unstructured data format because this data is very discrete, complex, multidimensional and noisy In Health care Electronic Health Records (EHS) which consists of patient’s disease records that uses better clinical decision making. This data in clinical healthcare provide the way to perform predictive analysis. Speaking about the utitlities of distributed systems such as Hadoop and Mapreduce is more advntage in clinical healthcare research ares because of its large storage space and computation for huge set of data.

LITERATURE SURVEY
BIG HEALTH CARE DATA FOR ANALYTICS
Risk Factor Prediction
Decision Tree
CS-SVM
ELM BASED DISEASE RISK PREDICTION CLASSIFICATION ALGORITHM
10 Nodes 20 Nodes 40 Nodes 50 Nodes nodes in cluster
RESULT
Findings
CONCLUSION
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.