Abstract

The incidence of stroke cases has witnessed a rapid global rise, affecting not only the elderly but also individuals across all age groups. Accurate prediction of stroke occurrence demands the utilization of extensive data pre-processing techniques. Moreover, the automation of early stroke forecasting is crucial to prevent its onset at the initial stage. In this study, stroke prediction models are evaluated to estimate the likelihood of stroke based on various symptoms such as age, gender, pre-existing medical conditions, and social variables. The machine learning techniques employed include Linear Support Vector Classifier, Extreme Gradient Boosting Classifier, Multilayer Perceptron, Adaptive Boosting Classifier, Bootstrap Aggregating Classifier, and Light Gradient-Boosting Machine. The purpose of this paper is to optimize the hyperparameters of machine learning approaches in developing stroke prediction models. The goal was achieved through a comprehensive comparison of three different sampling techniques for handling imbalanced datasets and evaluating their performance by using various metrics. The most effective model is identified, which is the Adaptive Boosting Classifier utilizing the Tomek Links, with a cross-dataset accuracy of 99% which demonstrated a reliable performance and generalization as evidenced by high cross-validation scores and accuracy on an independent dataset. The next stage of this endeavor entails looking into multiple ways to forecast the development of new dangerous diseases such as breast cancer and skin disorders. In the long run, the aim of subsequent work is to build a powerful toolset that is obtainable to all medical practitioners, allowing for the pre-emptive diagnosis of all potentially hazardous illnesses.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.