Abstract
Background: Google Trends (GT) is an emerging source of data that can be used to predict, detect, and track infectious disease outbreaks. GT cumulative search volume data has been shown to correlate with cumulative case counts and to produce basic and observed reproduction number estimates analogous to those derived from more traditional epidemiological data sources. An outbreak of Hepatitis-E (Hep-E) occurred in Namibia in the fall and winter of 2017–2018. We used GT data to estimate transmission dynamics of the outbreak and compared these results with those estimated via data from HealthMap, a relatively new digital data source, and with surveillance reports from the government of Namibia published in the World Health Organization Bulletin, which is a traditional data source. Objective: Aim 1: To determine the correlation between GT relative search volume data (RSV) and cumulative case counts from the HealthMap (HM) and World Health Organization (WHO) data sources. Aim 2: To estimate and compare transmission dynamics including basic reproduction numbers (R0), observed reproduction numbers (Robs), and final outbreak size (Imax) for each of the three sources of data. Methods: GT relative search volume data regarding the term “hepatitis” in Namibia was acquired from October 13, 2017–March 2, 2018. Cumulative reported case counts were obtained from the HealthMap and WHO data sources. The Incidence Decay and Exponential Adjustment (IDEA) model was used to calculate R0, Robs, and final outbreak size for the three data sources. Results: The correlation coefficient between GT cumulative relative search volume and both HM and WHO cumulative case counts measured R = 0.93. The mean R0 and Robs estimates for the hepatitis-E outbreak in Namibia were similar between the GT, HM, and WHO data sources and are similar to previously published Hep-E R0 estimates from Uganda. Final outbreak size was similar between HM and WHO data sources; however, estimates using GT-derived data sources were smaller. Conclusions: GT cumulative search volume correlated with cumulative case counts from the HM and WHO data sources. Mean R0 and Robs values were similar among the data sources considered. GT-derived final outbreak size was smaller than both HM and WHO estimates due to diminishing search volume later in the epidemic possibly due to search fatigue; nevertheless, this data source was useful in describing the transmission dynamics of the outbreak including correlation with case counts and reproduction numbers.
Highlights
28.1.1 Google Trends and HealthMap DataGoogle Trends (GT) allows users to obtain search volume data on specified search terms from defined locations and specified time frames (Nuti et al 2014; Google Trends 2018)
The Namibian Government and the WHO sponsored a “media day” to alert the general public and the medical community about the hepatitis-E outbreak on December 20, 2018 and additional public events and interventions occurred during late December 2017 through January 2018 (World Health Organization 2018a)
There is growing evidence in the literature that non-traditional digital surveillance data can accurately estimate transmission dynamics, including case counts and basic and observed reproduction numbers, during an acute infectious outbreak (Ocampo et al 2013; Yang et al 2017; Alicino et al 2015; Yang et al 2017; Majumder et al 2016) Our analysis supports the hypothesis that non-traditional data sources such as cumulative GT relative search volume data (RSV) data and HM data correlate well with traditional epidemiological surveillance data sources such as WHO cumulative case counts
Summary
28.1.1 Google Trends and HealthMap DataGoogle Trends (GT) allows users to obtain search volume data on specified search terms from defined locations and specified time frames (Nuti et al 2014; Google Trends 2018). A growing number of epidemiologic studies show correlation between GT cumulative search volume and cumulative case counts in acute infectious outbreaks. GT cumulative search volume data has been shown to correlate with cumulative case counts and to produce basic and observed reproduction number estimates analogous to those derived from more traditional epidemiological data sources. We used GT data to estimate transmission dynamics of the outbreak and compared these results with those estimated via data from HealthMap, a relatively new digital data source, and with surveillance reports from the government of Namibia published in the World Health Organization Bulletin, which is a traditional data source. Objective: Aim 1: To determine the correlation between GT relative search volume data (RSV) and cumulative case counts from the HealthMap (HM) and World Health Organization (WHO) data sources. Cumulative reported case counts were obtained from the Michael Morley and Maia Majumder contributed and are co-first authors
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have