Abstract

Around 60% of COVID-19 positive cases in Indonesia have occurred in Java Island. This study provides clustering adjacent regions (cities and regencies) in Java Island into some groups based on some socio-economic factors that are suspected to affect the COVID-19 infection rates (positive cases per 100,000 residents), which could be useful for decision making by government. The factors involved in this study are poverty percentage, Human Development Index (HDI), average of expenditure per month, and open unemployment rate. There are two steps in our data analysis: first, we determined the factors that affected the infection rate significantly by using lasso, and then we estimated region-specific effects of each significant factor by using generalized lasso. In the generalized lasso, two types of spatial structure were considered, namely, regions divided by province, and neighbourhood regions based on k-means clustering and Voronoi tessellation. The tuning parameter in both lasso and generalized lasso was selected by 5-folds cross-validation. Based on the first step, three variables were found to affect the infection rate significantly. Then in the second step, the three variables had spatially varying coefficients in the generalized lasso using regions divided by provinces. On the other hand, HDI provided spatially varying coefficient in the generalized lasso using region based on k-means clustering and Voronoi tessellation.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call