Comparison of Data Mining Classification Algorithms for Stroke Disease Prediction Using the SMOTE Upsampling Method

Ronald Sebastian,Christina Juliane

doi:10.30595/juita.v11i2.17348

Abstract

Stroke is a circulation disorder in the brain that can cause symptoms and signs related to the affected part of the brain and is the leading cause of death and disability in Indonesia. Everyone is at risk of experiencing a stroke, and it is important to recognize and manage risk factors. Data Mining techniques can help in the extraction and prediction of information, as well as finding hidden patterns in stroke medical data. The dataset used in this research comes from Kaggle and is imbalanced, so the SMOTE Upsampling technique is used to address this imbalance issue. The results of the study conclude that the use of SMOTE technique in the C4.5, NB, and KNN algorithms can increase precision, recall, and AUC. The C4.5 algorithm and SMOTE technique as the best performing algorithm were selected for testing new data, and the results show that the model created can predict stroke risk more accurately than the C4.5 model without SMOTE. However, it should be noted that based on the author's interview with one of the medical practitioners, the model cannot be directly used in medical practice because the observations in the medical field to determine factors related to stroke are highly complex. Thus, a new understanding revealed that predicting stroke in a practical setting is highly complex. While data mining can be used as a predictive tool in the initial stage for predictions in the general population, it is strongly recommended to undergo direct examination by doctors in a hospital to obtain more accurate and comprehensive medical evaluations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparison of Data Mining Classification Algorithms for Stroke Disease Prediction Using the SMOTE Upsampling Method

Abstract

Talk to us

Similar Papers

More From: JUITA : Jurnal Informatika

Lead the way for us

Journal: JUITA : Jurnal Informatika	Publication Date: Nov 17, 2023
License type: CC BY 4.0

Similar Papers

Potensi Keberhasilan Bakal Calon Legislatif Menggunakan Algoritma K-Nearest Neighbors
Cyntia Rivatunisa
Jurnal Sistim Informasi dan Teknologi | VOL. -
Cyntia RivatunisaCyntia Rivatunisa
08 Mar 2022
Jurnal Sistim Informasi dan Teknologi | VOL. -

Heart Diseases Prediction with Data Mining and Neural Network Techniques
Bandarage Shehani Sanketha Rathnayakc ... Gamage Upeksha Ganegoda
-
Bandarage Shehani Sanketha Rathnayakc, et. al.Bandarage Shehani Sanketha Rathnayakc ... Gamage Upeksha Ganegoda
01 Apr 2018
01 Apr 2018

An Intelligent and Electronic System based Classification and Prediction for Heart Disease Diagnosis
Basheer Mohammed Al-Maqaleh
International journal of Emerging Trends in Science and Technology | VOL. -
Basheer Mohammed Al-MaqalehBasheer Mohammed Al-Maqaleh
18 May 2016
International journal of Emerging Trends in Science and Technology | VOL. -

Cardiovascular Disease in Children with Chronic Kidney Disease
Mark M Mitsnefes
Advances in Chronic Kidney Disease | VOL. 12
Mark M MitsnefesMark M Mitsnefes
01 Oct 2005
Advances in Chronic Kidney Disease | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison of Data Mining Classification Algorithms for Stroke Disease Prediction Using the SMOTE Upsampling Method

Abstract

Talk to us

Similar Papers

More From: JUITA : Jurnal Informatika