Abstract

Stroke are known as the second most leading cause of death. Because of this, data mining techniques are already being used to predict patients that may have stroke. Therefore, we are doing a study to try using data mining techniques using RapidMiner to find information or patterns regarding stroke from a dataset obtained from Kaggle. Three data mining techniques are used in this study, that is classification using decision trees, association rule using FP-Growth algorithm, and clustering technique using k-Means algorithm. Using RapidMiner, we are able to process the dataset using the operators provided in the application. As the result, we found out that due to an unbalanced data, the decision tree model made were only able to predict 68,75% patients as having stroke. With the association rule technique, we found out that most attributes in the dataset does not really associated with each other. With the clustering technique, we were able to group up patients and found out that most patients that have stroke are averaged in the age of 58, with 31 bmi, and 201 average glucose level.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.