Abstract. Predicting infections resulting from epidemiological contagions involves a complex therapeutic endeavor that necessitates the consideration of various factors within the available data. COVID-19, a globally recognized highly dangerous disease, presents an opportunity to effectively understand its patterns and spatial determinants to combat similar ailments. This study seeks to employ a deep neural network and several machine learning techniques to identify the crucial factors influencing COVID-19 infections and to forecast the spatial-temporal spread of the disease. Previous research has focused on predicting outcomes using a limited set of 8 patient variables. To contribute to this field, the current investigation analyzes a dataset comprising 47,029 records of infected individuals who sought medical attention, with 18,433 of them testing positive for the coronavirus and requiring hospitalization. The study explores the prediction of illness and hospitalization needs based on geographical locations by utilizing machine learning algorithms such as Random Forest, Naive Bayes, Decision Tree, and K-Nearest Neighbor. The machine learning process commences with input data (comprising the 47,029 records), enabling the machine to identify patterns within the dataset and subsequently make informed decisions based on these patterns and insights gained. The primary objective is to allow the machine to autonomously learn and refine its predictions without human intervention. Applying these algorithms to COVID-19 data for spatial-temporal prediction revealed that the Random Forest algorithm achieved the highest accuracy of 0.84, while the Simple Bayes algorithm exhibited the lowest accuracy at 0.50. Additionally, the study compared random forest, decision tree, simple bayes, and K-nearest neighbor algorithms to predict the severity of road accidents.
Read full abstract