Performa Comparison of the K-Means Method for Classification in Diabetes Patients Using Two Normalization Methods

Dwianti Westari

doi:10.47191/ijmra/v4-i1-03

Abstract

The diabetes classification system is very useful in the health sector. This paper discusses the classification system for diabetes using the K-Means algorithm. The Pima Indian Diabetes (PID) dataset is used to train and evaluate this algorithm. The unbalanced value range in the attributes affects the quality of the classification result, so it is necessary to preprocess the data which is expected to improve the accuracy of the PID dataset classification result. Two types of preprocessing methods are used that are min-max normalization and z-score normalization. These two normalization methods are used and the classification accuracies are compared. Before the data classification process is carried out, the data is divided into training data and test data. The result of the classification test using the K-Means algorithm has shown that the best accuracy lies in the PID dataset which has been normalized using the min-max normalization method, which 79% compared to z-score normalization.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Performa Comparison of the K-Means Method for Classification in Diabetes Patients Using Two Normalization Methods

Abstract

Talk to us

Similar Papers

More From: INTERNATIONAL JOURNAL OF MULTIDISCIPLINARY RESEARCH AND ANALYSIS

Lead the way for us

Journal: INTERNATIONAL JOURNAL OF MULTIDISCIPLINARY RESEARCH AND ANALYSIS	Publication Date: Jan 6, 2021
Citations: 2

Similar Papers

Oversample‐select‐tune: A machine learning pipeline for improving diabetes identification
Sujit Kumar Das ... Arnab Kumar Mishra
Concurrency and Computation: Practice and Experience | VOL. 34
Sujit Kumar Das, et. al.Sujit Kumar Das ... Arnab Kumar Mishra
30 Nov 2021
Concurrency and Computation: Practice and Experience | VOL. 34

A Computational Intelligence Technique for Effective and Early Diabetes Detection using Rough Set Theory
Allam Apparao ... P V Nageswara Rao
International Journal of Computer Applications | VOL. 95
Allam Apparao, et. al.Allam Apparao ... P V Nageswara Rao
18 Jun 2014
International Journal of Computer Applications | VOL. 95

Kombinasi Metode Correlated Naive Bayes dan Metode Seleksi Fitur Wrapper untuk Klasifikasi Data Kesehatan
Hairani Hairani ... Muhammad Innuddin
Jurnal Teknik Elektro | VOL. 11
Hairani Hairani, et. al.Hairani Hairani ... Muhammad Innuddin
27 Apr 2020
Jurnal Teknik Elektro | VOL. 11

Application of Adaptive Neuro-Fuzzy Inference System for diabetes classification and prediction
Oana Geman ... Roxana Toderean
-
Oana Geman, et. al.Oana Geman ... Roxana Toderean
01 Jun 2017
01 Jun 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performa Comparison of the K-Means Method for Classification in Diabetes Patients Using Two Normalization Methods

Abstract

Talk to us

Similar Papers

More From: INTERNATIONAL JOURNAL OF MULTIDISCIPLINARY RESEARCH AND ANALYSIS