Abstract

BackgroundInfectious diarrhea can lead to a considerable global disease burden. Thus, the accurate prediction of an infectious diarrhea epidemic is crucial for public health authorities. This study was aimed at developing an optimal random forest (RF) model, considering meteorological factors used to predict an incidence of infectious diarrhea in Jiangsu Province, China.MethodsAn RF model was developed and compared with classical autoregressive integrated moving average (ARIMA)/X models. Morbidity and meteorological data from 2012 to 2016 were used to construct the models and the data from 2017 were used for testing.ResultsThe RF model considered atmospheric pressure, precipitation, relative humidity, and their lagged terms, as well as 1–4 week lag morbidity and time variable as the predictors. Meanwhile, a univariate model ARIMA (1,0,1)(1,0,0)52 (AIC = − 575.92, BIC = − 558.14) and a multivariable model ARIMAX (1,0,1)(1,0,0)52 with 0–1 week lag precipitation (AIC = − 578.58, BIC = − 578.13) were developed as benchmarks. The RF model outperformed the ARIMA/X models with a mean absolute percentage error (MAPE) of approximately 20%. The performance of the ARIMAX model was comparable to that of the ARIMA model with a MAPE reaching approximately 30%.ConclusionsThe RF model fitted the dynamic nature of an infectious diarrhea epidemic well and delivered an ideal prediction accuracy. It comprehensively combined the synchronous and lagged effects of meteorological factors; it also integrated the autocorrelation and seasonality of the morbidity. The RF model can be used to predict the epidemic level and has a high potential for practical implementation.

Highlights

  • Infectious diarrhea can lead to a considerable global disease burden

  • The autoregressive integrated moving average (ARIMA) model has been widely used as classical method for diarrhea incidence prediction, it has some limitations at the same time [4,5,6,7]

  • Meteorological factors have been reported to be non-linearly associated with the infectious diarrhea epidemic [9, 10]

Read more

Summary

Introduction

Infectious diarrhea can lead to a considerable global disease burden. the accurate prediction of an infectious diarrhea epidemic is crucial for public health authorities. This study was aimed at developing an optimal random forest (RF) model, considering meteorological factors used to predict an incidence of infectious diarrhea in Jiangsu Province, China. Infectious diarrhea is one of the major causes of morbidity and mortality in infants and younger populations. It is a major global public health issue, in developing countries [1]. Several studies have reported that meteorological factors are associated with diarrhea and can be used to predict its incidence [8, 9]. Meteorological factors have been reported to be non-linearly associated with the infectious diarrhea epidemic [9, 10]

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call