Performance analysis and comparison of various machine learning algorithms for early stroke prediction

Vinay Padimi,Devarani Devi Ningombam,Venkata Sravan Telu

doi:10.4218/etrij.2022-0271

Vinay Padimi, Devarani Devi Ningombam + Show 1 more

Open Access

https://doi.org/10.4218/etrij.2022-0271

Copy DOI

Abstract

AbstractStroke is the leading cause of permanent disability in adults, and it can cause permanent brain damage. According to the World Health Organization, 795 000 Americans experience a new or recurrent stroke each year. Early detection of medical disorders, for example, strokes, can minimize the disabling effects. Thus, in this paper, we consider various risk factors that contribute to the occurrence of stoke and machine learning algorithms, for example, the decision tree, random forest, and naive Bayes algorithms, on patient characteristics survey data to achieve high prediction accuracy. We also consider the semisupervised self‐training technique to predict the risk of stroke. We then consider the near‐miss undersampling technique, which can select only instances in larger classes with the smaller class instances. Experimental results demonstrate that the proposed method obtains an accuracy of approximately 98.83% at low cost, which is significantly higher and more reliable compared with the compared techniques.

Full Text