Reappraising the utility of Google Flu Trends.

Sasikiran Kandula,Jeffrey Shaman,Nicola Segata

doi:10.1371/journal.pcbi.1007258

Sasikiran Kandula, Jeffrey Shaman + Show 1 more

Open Access

https://doi.org/10.1371/journal.pcbi.1007258

Copy DOI

Journal: PLOS Computational Biology	Publication Date: Aug 2, 2019
Citations: 69	License type: CC BY 4.0

Affiliation: Columbia University

Abstract

Estimation of influenza-like illness (ILI) using search trends activity was intended to supplement traditional surveillance systems, and was a motivation behind the development of Google Flu Trends (GFT). However, several studies have previously reported large errors in GFT estimates of ILI in the US. Following recent release of time-stamped surveillance data, which better reflects real-time operational scenarios, we reanalyzed GFT errors. Using three data sources—GFT: an archive of weekly ILI estimates from Google Flu Trends; ILIf: fully-observed ILI rates from ILINet; and, ILIp: ILI rates available in real-time based on partial reporting—five influenza seasons were analyzed and mean square errors (MSE) of GFT and ILIp as estimates of ILIf were computed. To correct GFT errors, a random forest regression model was built with ILI and GFT rates from the previous three weeks as predictors. An overall reduction in error of 44% was observed and the errors of the corrected GFT are lower than those of ILIp. An 80% reduction in error during 2012/13, when GFT had large errors, shows that extreme failures of GFT could have been avoided. Using autoregressive integrated moving average (ARIMA) models, one- to four-week ahead forecasts were generated with two separate data streams: ILIp alone, and with both ILIp and corrected GFT. At all forecast targets and seasons, and for all but two regions, inclusion of GFT lowered MSE. Results from two alternative error measures, mean absolute error and mean absolute proportional error, were largely consistent with results from MSE. Taken together these findings provide an error profile of GFT in the US, establish strong evidence for the adoption of search trends based 'nowcasts' in influenza forecast systems, and encourage reevaluation of the utility of this data source in diverse domains.

Highlights

Surveillance of seasonal influenza and other respiratory illnesses deservedly receives significant attention from public health agencies in the United States
Google Flu Trends (GFT) was proposed as a method to estimate influenza-like illness (ILI) in the general population and to be used in conjunction with traditional surveillance systems
Several previous studies have documented that GFT estimates were often overestimates of ILI

Summary

Introduction

Google has not offered reasons for the termination, one contributing factor could well have been the widely reported propensity of GFT to over-estimate ILI, which effectively morphed it in the public perception from a poster child for the power and utility of big data to one of its hubris [14,15,16,17,18,19,20]. In this paper, using newly available surveillance data, we revisit GFT estimates for locations in the US and show that its errors are less substantial than previously reported

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reappraising the utility of Google Flu Trends.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology

Lead the way for us

Similar Papers

Monitoring Influenza Activity in the United States: A Comparison of Traditional Surveillance Systems with Google Flu Trends
Justin R Ortiz ... Vernon Lee
PLoS ONE | VOL. 6
Justin R Ortiz, et. al.Justin R Ortiz ... Vernon Lee
27 Apr 2011
PLoS ONE | VOL. 6

Comparing Observed with Predicted Weekly Influenza-Like Illness Rates during the Winter Holiday Break, United States, 2004-2013.
Hongjiang Gao ... Dena L Schanzer
PLOS ONE | VOL. 10
Hongjiang Gao, et. al.Hongjiang Gao ... Dena L Schanzer
09 Dec 2015
PLOS ONE | VOL. 10

Improving Google Flu Trends estimates for the United States through transformation.
Leah J Martin ... Edward Goldstein
PLoS ONE | VOL. 9
Leah J Martin, et. al.Leah J Martin ... Edward Goldstein
31 Dec 2015
PLoS ONE | VOL. 9

Relationship Between Baseline Influenza-like Illness Rates And Healthcare Settings
Dino Rumoro ... Gordon Trenholme
Online Journal of Public Health Informatics | VOL. 9
Dino Rumoro, et. al.Dino Rumoro ... Gordon Trenholme
02 May 2017
Online Journal of Public Health Informatics | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reappraising the utility of Google Flu Trends.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology