Abstract

The k-harmonic means method is a method of using the cluster center point value, which is to determine each cluster from its center point based on the calculation of the harmonic average. The k-harmonic means determines the existence of each data point based on the membership function and weighting function by using a distance measure. in the clustering, which aims to increase the importance of data that is far from each central point. This causes the k-harmonic means to be insensitive in initialization in determining the cluster center point and significantly improves the quality of clustering compared to k-means. In determining the level of similarity, the determination of the level of similarity uses the distance measure and the distance measure used is the Euclidean distance measure. The distance measure used in cluster analysis can affect the cluster results obtained. Thus, to determine the quality of the results of the cluster analysis, validation tests were carried out using an internal criteria approach, namely silhouette coefficient. In this study, the k-harmonic means used to classify provinces in Indonesia based on the causes of stunting in 2018. The stunting in children under five in Indonesia has exceeded the limit set by WHO. In 2016-2017 there was an increase in the prevalence of stunting by 27.5% to 29.6%. The k-harmonic means method is used so that the four main factors causing stunting in every province in Indonesia can be seen and the prevention and cure of stunting can run optimally. This method is also used because the data on the four factors that cause stunting show a significant rate of change and as a measure of central tendency in 34 provincial objects in Indonesia. Four factors that cause stunting are used, namely the percentage of households that do not have access to clean drinking water, the percentage of exclusive breastfeeding, the percentage of Low Birth Weight Babies (LBW) 2,500-grams born safely and the percentage of households that do not have proper sanitation facilities. The results obtained by the cluster which is optimal at k= 3 using the Euclidean, where the silhouette coefficient = 0,3040722675 ≈ 0,3. Based on the results of the cluster analysis, it is known that in cluster one, the main factor that stands out the most is the percentage of exclusive breastfeeding. In cluster two, the main factor that stands out the most is the percentage of Low Birth Weight Babies (LBW) 2,500-grams born safely. In cluster three, the most prominent main factors are the percentage of Low Birth Weight Babies (LBW) 2,500-grams born safely and the percentage of households that do not have proper sanitation facilities with the highest average centroid among other clusters. Keywords: Clustering, K-Harmonic Means, Euclidean distance, Silhouette Coefficient, Stunting

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.