Abstract

These days technology are rapidly increasing and developing in various fields, especially data storage. The information that has been stored in a database usually called a dataset. Covid-19 is a new type of respiratory disease that attacks the respiratory system with rapid transmission, followed by the increasing number of Covid-19 cases that continues to increase every day in all provinces in Indonesia. This study aims to cluster the spread of Covid-19 in every province in Indonesia by using the data that obtained from the website named kaggle with many data variables. The method used in this research is K-Means. From many variables in the data, for this study only 3 variables were taken, which are: Number of Recovery, Number of Deaths, and Number of total Cases in Covid-19 in Indonesia. These 3 variables then will be applied using the K-Means method and formed 3 provincial groups. By using the clustering method and the K-means algorithm, this research can be carried out to find the characteristics of the distribution in each province in Indonesia by looking at the best clusters.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call