Abstract

BackgroundInternet search data on health-related terms can reflect people’s concerns about their health status in near real time, and hence serve as a supplementary metric of disease characteristics. However, studies using internet search data to monitor and predict chronic diseases at a geographically finer state-level scale are sparse.ObjectiveThe aim of this study was to explore the associations of internet search volumes for lung cancer with published cancer incidence and mortality data in the United States.MethodsWe used Google relative search volumes, which represent the search frequency of specific search terms in Google. We performed cross-sectional analyses of the original and disease metrics at both national and state levels. A smoothed time series of relative search volumes was created to eliminate the effects of irregular changes on the search frequencies and obtain the long-term trends of search volumes for lung cancer at both the national and state levels. We also performed analyses of decomposed Google relative search volume data and disease metrics at the national and state levels.ResultsThe monthly trends of lung cancer-related internet hits were consistent with the trends of reported lung cancer rates at the national level. Ohio had the highest frequency for lung cancer-related search terms. At the state level, the relative search volume was significantly correlated with lung cancer incidence rates in 42 states, with correlation coefficients ranging from 0.58 in Virginia to 0.94 in Oregon. Relative search volume was also significantly correlated with mortality in 47 states, with correlation coefficients ranging from 0.58 in Oklahoma to 0.94 in North Carolina. Both the incidence and mortality rates of lung cancer were correlated with decomposed relative search volumes in all states excluding Vermont.ConclusionsInternet search behaviors could reflect public awareness of lung cancer. Research on internet search behaviors could be a novel and timely approach to monitor and estimate the prevalence, incidence, and mortality rates of a broader range of cancers and even more health issues.

Highlights

  • Cancer affects people at all socioeconomic levels and has become a worldwide public health problem

  • Our findings suggest that internet search volumes may reflect disease characteristics of lung cancer and provide an additional means of national- and state-level cancer surveillance in the United States

  • The search data are updated in real time, as mentioned above, the publication of cancer registration data usually lag behind data collection for several years; the latest incidence and mortality rates of lung cancer available for the present analysis were those published in 2015 by the Centers for Disease Control and Prevention (CDC)

Read more

Summary

Introduction

Cancer affects people at all socioeconomic levels and has become a worldwide public health problem. In the United States, cancer is the second leading cause of death, resulting in approximately 150,000 deaths per year [2,3]. The mortality rate of lung cancer is the highest in the United States. Current data on cancer incidence, mortality, and survival have been mainly collected by the US Centers for Disease Control and Prevention (CDC) and the National Cancer Institute (NCI). New methods in the era of big data are needed to help supplement current strategies and improve the monitoring of lung cancer. Internet search data on health-related terms can reflect people’s concerns about their health status in near real time, and serve as a supplementary metric of disease characteristics. Studies using internet search data to monitor and predict chronic diseases at a geographically finer state-level scale are sparse

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call