Abstract
Dengue and influenza-like illness (ILI) are two of the leading causes of viral infection in the world and it is estimated that more than half the world’s population is at risk for developing these infections. It is therefore important to develop accurate methods for forecasting dengue and ILI incidences. Since data from multiple sources (such as dengue and ILI case counts, electronic health records and frequency of multiple internet search terms from Google Trends) can improve forecasts, standard time series analysis methods are inadequate to estimate all the parameter values from the limited amount of data available if we use multiple sources. In this paper, we use a computationally efficient implementation of the known variable selection method that we call the Autoregressive Likelihood Ratio (ARLR) method. This method combines sparse representation of time series data, electronic health records data (for ILI) and Google Trends data to forecast dengue and ILI incidences. This sparse representation method uses an algorithm that maximizes an appropriate likelihood ratio at every step. Using numerical experiments, we demonstrate that our method recovers the underlying sparse model much more accurately than the lasso method. We apply our method to dengue case count data from five countries/states: Brazil, Mexico, Singapore, Taiwan, and Thailand and to ILI case count data from the United States. Numerical experiments show that our method outperforms existing time series forecasting methods in forecasting the dengue and ILI case counts. In particular, our method gives a 18 percent forecast error reduction over a leading method that also uses data from multiple sources. It also performs better than other methods in predicting the peak value of the case count and the peak time.
Highlights
Dengue is a mosquito-borne viral disease that affects a large fraction of the world [1]
We focus on a time series modelling approach to forecast dengue and influenza-like illness (ILI) incidences
We have presented Autoregressive Likelihood Ratio (ARLR) method for estimating a sparse autoregressive model from observations using a likelihood ratio approach
Summary
Dengue is a mosquito-borne viral disease that affects a large fraction of the world [1] It is estimated [2] that almost half the world’s population spread out over 128 countries is at risk of dengue infection while 400 million people could be infected by dengue [3] every year. Dengue cases have been reported in Europe, China, and the USA [1] expanding the regions that could witness dengue outbreaks even further. Influenza is another viral disease that affects a significant fraction of the world population. Deaths occur mainly among people aged 65 years or above in the developed world [5] and children below 5 years of age in developing countries [6]
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.