FluBreaks: Early Epidemic Detection from Google Flu Trends

Fahad Pervaiz,Mansoor Pervaiz,Umar Saif,Nabeel Abdur Rehman

doi:10.2196/jmir.2102

Fahad Pervaiz, Mansoor Pervaiz + Show 2 more

Open Access

https://doi.org/10.2196/jmir.2102

Copy DOI

Abstract

BackgroundThe Google Flu Trends service was launched in 2008 to track changes in the volume of online search queries related to flu-like symptoms. Over the last few years, the trend data produced by this service has shown a consistent relationship with the actual number of flu reports collected by the US Centers for Disease Control and Prevention (CDC), often identifying increases in flu cases weeks in advance of CDC records. However, contrary to popular belief, Google Flu Trends is not an early epidemic detection system. Instead, it is designed as a baseline indicator of the trend, or changes, in the number of disease cases.ObjectiveTo evaluate whether these trends can be used as a basis for an early warning system for epidemics.MethodsWe present the first detailed algorithmic analysis of how Google Flu Trends can be used as a basis for building a fully automated system for early warning of epidemics in advance of methods used by the CDC. Based on our work, we present a novel early epidemic detection system, called FluBreaks (dritte.org/flubreaks), based on Google Flu Trends data. We compared the accuracy and practicality of three types of algorithms: normal distribution algorithms, Poisson distribution algorithms, and negative binomial distribution algorithms. We explored the relative merits of these methods, and related our findings to changes in Internet penetration and population size for the regions in Google Flu Trends providing data.ResultsAcross our performance metrics of percentage true-positives (RTP), percentage false-positives (RFP), percentage overlap (OT), and percentage early alarms (EA), Poisson- and negative binomial-based algorithms performed better in all except RFP. Poisson-based algorithms had average values of 99%, 28%, 71%, and 76% for RTP, RFP, OT, and EA, respectively, whereas negative binomial-based algorithms had average values of 97.8%, 17.8%, 60%, and 55% for RTP, RFP, OT, and EA, respectively. Moreover, the EA was also affected by the region’s population size. Regions with larger populations (regions 4 and 6) had higher values of EA than region 10 (which had the smallest population) for negative binomial- and Poisson-based algorithms. The difference was 12.5% and 13.5% on average in negative binomial- and Poisson-based algorithms, respectively.ConclusionsWe present the first detailed comparative analysis of popular early epidemic detection algorithms on Google Flu Trends data. We note that realizing this opportunity requires moving beyond the cumulative sum and historical limits method-based normal distribution approaches, traditionally employed by the CDC, to negative binomial- and Poisson-based algorithms to deal with potentially noisy search query data from regions with varying population and Internet penetrations. Based on our work, we have developed FluBreaks, an early warning system for flu epidemics using Google Flu Trends.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Medical Internet Research	Publication Date: Oct 4, 2012
Citations: 112	License type: cc-by

R Discovery Prime

R Discovery Prime

FluBreaks: Early Epidemic Detection from Google Flu Trends

Abstract

Talk to us

Similar Papers

More From: Journal of Medical Internet Research

Lead the way for us

Similar Papers

Emergency department and ‘Google flu trends’ data as syndromic surveillance indicators for seasonal influenza
L H Thompson ... S M Mahmud
Epidemiology and Infection | VOL. 142
L H Thompson, et. al.L H Thompson ... S M Mahmud
20 Jan 2014
Epidemiology and Infection | VOL. 142

“Google Flu Trends” and Emergency Department Triage Data Predicted the 2009 Pandemic H1N1 Waves in Manitoba
Mohammad Tufail Malik ... Laura H Thompson
Canadian Journal of Public Health | VOL. 102
Mohammad Tufail Malik, et. al.Mohammad Tufail Malik ... Laura H Thompson
01 Jul 2011
“Google Flu Trends” and Emergency Department Triage Data Predicted the 2009 Pandemic H1N1 Waves in Manitoba
Mohammad Tufail Malik ... Laura H Thompson

Reassessing Google Flu Trends data for detection of seasonal and pandemic influenza: a comparative epidemiological study at three geographic scales.
Donald R Olson ... Kevin J Konty
PLoS Computational Biology | VOL. 9
Donald R Olson, et. al.Donald R Olson ... Kevin J Konty
17 Oct 2013
PLoS Computational Biology | VOL. 9

Improving Google Flu Trends estimates for the United States through transformation.
Leah J Martin ... Biying Xu
PLoS ONE | VOL. 9
Leah J Martin, et. al.Leah J Martin ... Biying Xu
31 Dec 2015
PLoS ONE | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

FluBreaks: Early Epidemic Detection from Google Flu Trends

Abstract

Talk to us

Similar Papers

More From: Journal of Medical Internet Research