Abstract

The novel coronavirus (COVID-19) is an emergent disease that initially had no historical data to guide scientists on predicting/ forecasting its global or national impact over time. The ability to predict the progress of this pandemic has been crucial for decision making aimed at fighting this pandemic and controlling its spread. In this work we considered four different statistical/time series models that are readily available from the ‘forecast’ package in R. We performed novel applications with these models, forecasting the number of infected cases (confirmed cases and similarly the number of deaths and recovery) along with the corresponding 90% prediction interval to estimate uncertainty around pointwise forecasts. Since the future may not repeat the past for this pandemic, no prediction model is certain. However, any prediction tool with acceptable prediction performance (or prediction error) could still be very useful for public-health planning to handle spread of the pandemic, and could policy decision-making and facilitate transition to normality. These four models were applied to publicly available data of the COVID-19 pandemic for both the USA and Italy. We observed that all models reasonably predicted the future numbers of confirmed cases, deaths, and recoveries of COVID-19. However, for the majority of the analyses, the time series model with autoregressive integrated moving average (ARIMA) and cubic smoothing spline models both had smaller prediction errors and narrower prediction intervals, compared to the Holt and Trigonometric Exponential smoothing state space model with Box-Cox transformation (TBATS) models. Therefore, the former two models were preferable to the latter models. Given similarities in performance of the models in the USA and Italy, the corresponding prediction tools can be applied to other countries grappling with the COVID-19 pandemic, and to any pandemics that can occur in future.

Highlights

  • Cases of severe respiratory illness began to be reported across the city of Wuhan in China in December 2019

  • How many people will be infected? How does the situation change day by day? Can we predict/forecast the future numbers of people infected with COVID-19 by using daily updated data on the pandemic’s trajectory? These questions could be answered by forecasting the possible futures of this pandemic through statistical/time series models

  • We provide statistical forecasts for the confirmed cases of COVID-19 using four different, highly cited time series models, which we subsequently introduce, and compare their performance to analyze the trajectory of cases

Read more

Summary

Introduction

Cases of severe respiratory illness began to be reported across the city of Wuhan in China in December 2019. Can we predict/forecast the future numbers of people infected with COVID-19 by using daily updated data on the pandemic’s trajectory? As the COVID-19 pandemic evolves and additional data are amassed [4], new insights emerge on the reasonableness of the prediction models developed and tested; predicting its future requires transparent reporting and multiple model assessments [3]. Despite these ongoing challenges, forecasting is still critical to better understand the current situation and progression of COVID-19, to prepare for the future of this ongoing pandemic. We provide statistical forecasts for the confirmed cases of COVID-19 using four different, highly cited time series models, which we subsequently introduce, and compare their performance to analyze the trajectory of cases

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call