Abstract

The spread of the coronavirus in Indonesia is quite fast. The spread of Covid 19 is almost evenly distributed in all provinces in Indonesia. Some areas even have a fairly high mortality rate. Therefore, it is necessary to group regions to find out which areas have the highest to lowest Covid cases so that the appropriate response process can be carried out. In addition, data visualization is also needed that provides information on COVID-19 data for each province. In this study, the data were grouped using the K-Means Clustering method. The dataset used is the Indonesian Covid-19 dataset from Kaggle. The criteria for each province's covid cluster are the number of cases and deaths. The Clustering process uses the Python programming language. From the results of this study, it can be seen that there are 3 groups of covid. The first group consists of 30 provinces with several cases below 200,000 and a number of deaths below 6000. The second group contains two provinces that have the highest number of cases, namely above 600,000, but the number of deaths is less than group 3, which is 15000. In group 3 there are 2 provinces where the number of cases is below 500,000 but the death rate is above 30,000.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call