Optimasi Cluster Pada K-Means Clustering Dengan Teknik Reduksi Dimensi Dataset Menggunakan Gini Index

Herman Mawengkang,Muhammad Imam Zarkasyi,Opim Salim Sitompul

doi:10.47065/bits.v4i3.2458

Herman Mawengkang, Muhammad Imam Zarkasyi + Show 1 more

Open Access

https://doi.org/10.47065/bits.v4i3.2458

Copy DOI

Abstract

In K-Means Clustering, the number of attributes of a data can affect the number of iterations generated in the data grouping process. One of the solutions to overcome these problems is by using a reduction technique on the dimensions of the dataset. In this study, the authors apply the Gini Index to perform attribute reduction on the data set to reduce attributes that have no effect on the dataset before clustering with K-Means Clustering. The dataset used to be tested as a testing instrument in this research is Absenteeism at work obtained from the UCI Machine Learning Repository, with 20 attributes, 740 data records and 4 attribute classes. The results of the tests in this research indicate that the number of iterations obtained from the comparison of tests using the K-Means in a Conversional (Without Attribute Reduction) is obtained by the number of 9 iterations, while the K-Means with attribute reduction with the Gini Index obtains the number of iterations totaling 6 iterations. Clustering evaluation was calculated using Sum of Square Error (SSE). The SSE value in K-Means Clustering in a Conversional (Without Attribute Reduction) is 1391.613, while in K-Means Clustering with attribute reduction with a Gini Index, it is 440.912. From the results of the proposed method, it is able to reduce the percentage of errors and minimize the number of iterations in K-Means Clustering by reducing the dimensions of the dataset using the Gini Index

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimasi Cluster Pada K-Means Clustering Dengan Teknik Reduksi Dimensi Dataset Menggunakan Gini Index

Abstract

Talk to us

Similar Papers

More From: Building of Informatics, Technology and Science (BITS)

Lead the way for us

Journal: Building of Informatics, Technology and Science (BITS)	Publication Date: Dec 26, 2022
License type: CC BY 4.0

Similar Papers

Seleksi Fitur Menggunakan Eigen Vector Untuk Peningkatan Kinerja K-Means Clustering Dalam Pengelompokan Data
Muhammad Zarlis ... Syahril Efendi
Building of Informatics, Technology and Science (BITS) | VOL. 4
Muhammad Zarlis, et. al.Muhammad Zarlis ... Syahril Efendi
29 Sep 2022
Building of Informatics, Technology and Science (BITS) | VOL. 4

Analysis of the effect early cluster centre points on the combination of k-means algorithms and sum of squared error on k centroid
D Selvida ... M Zarlis
IOP Conference Series: Materials Science and Engineering | VOL. 725
D Selvida, et. al.D Selvida ... M Zarlis
01 Jan 2020
IOP Conference Series: Materials Science and Engineering | VOL. 725

Designing a parallel Feel-the-Way clustering algorithm on HPC systems
Weijian Zheng ... Dali Wang
The International Journal of High Performance Computing Applications | VOL. 35
Weijian Zheng, et. al.Weijian Zheng ... Dali Wang
28 Nov 2020
The International Journal of High Performance Computing Applications | VOL. 35

Decentralized K-means clustering with MANET swarms
...
-
, et. al. ...
13 Apr 2014
13 Apr 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimasi Cluster Pada K-Means Clustering Dengan Teknik Reduksi Dimensi Dataset Menggunakan Gini Index

Abstract

Talk to us

Similar Papers

More From: Building of Informatics, Technology and Science (BITS)