Data mining is searching for interesting patterns or information by selecting data using specific techniques or methods. One method that can be used in data mining is K-Medoids. K-Medoids is a method used to group objects into a cluster. This research aimed to obtain the optimal number of clusters using the K-Medoids method based on Davies-Bouldin Index (DBI) validity on education indicators data by province in Indonesia in 2021. The results showed that the optimal number of clusters using the K-Medoids method based on DBI validity is 5 clusters. Cluster 1 consists of 1 province with a higher average dropout rate, average length of schooling, and well-owned classrooms compared to other clusters. Cluster 2 consists of 15 provinces with an average proportion of school libraries lower than Clusters 3 and 4 and higher than Clusters 1 and 5. Cluster 3 consists of 9 provinces with an average proportion of school libraries, proportions of school laboratories, net enrollment rates, and higher school enrollment rates than other clusters. Cluster 4 consists of 8 provinces with a higher average enrollment rate than the other clusters. Cluster 5 consists of 1 province with a higher average repetition rate and student-per-teacher ratio than other clusters.
Read full abstract