Prediksi Diabetes Melitus Tipe-2 Menggunakan Sequential Forward Selection (SFS) Dengan Algoritma Support Vector Machine (SVM)

Saputra Saputra,A Syamsu Irfan Akbar,Cipta Ramadhani Cipta Ramadhani

doi:10.29303/dielektrika.v11i1.381

Abstract

Diabetes is a chronic disease and can cause long-term complications if not handled properly. To prevent this, a machine learning model is needed to predict diabetes with high accuracy. This study aims to see the effect of reducing feature dimensions on model performance and to see the effect of data cleaning on model performance. This study used the Pima Indian Dataset, two models were created with different preprocessing stages. The first model was created without performing data cleansing, and the second model was created by performing data cleansing. After the next preprocessing stage, the number of features that produce the best performance is sought using Sequential Forward Selection and the model is drilled using the Support Vector Machine algorithm. After going through the training stage, the two models will be tested and their performance will be compared. The results showed that reducing the number of features made the model have better performance. And of the two types of models, the model that uses the data cleaning stage shows better performance.

Full Text