Twitter-Based Influenza Detection After Flu Peak via Tweets With Indirect Information: Text Mining Study.

Shoko Wakamiya,Yukiko Kawai,Eiji Aramaki

doi:10.2196/publichealth.8627

Shoko Wakamiya, Yukiko Kawai + Show 1 more

Open Access

https://doi.org/10.2196/publichealth.8627

Copy DOI

Journal: JMIR Public Health and Surveillance	Publication Date: Sep 25, 2018
Citations: 64	License type: cc-by

Affiliation: Kyoto Sangyo University

Abstract

BackgroundThe recent rise in popularity and scale of social networking services (SNSs) has resulted in an increasing need for SNS-based information extraction systems. A popular application of SNS data is health surveillance for predicting an outbreak of epidemics by detecting diseases from text messages posted on SNS platforms. Such applications share the following logic: they incorporate SNS users as social sensors. These social sensor–based approaches also share a common problem: SNS-based surveillance are much more reliable if sufficient numbers of users are active, and small or inactive populations produce inconsistent results.ObjectiveThis study proposes a novel approach to estimate the trend of patient numbers using indirect information covering both urban areas and rural areas within the posts.MethodsWe presented a TRAP model by embedding both direct information and indirect information. A collection of tweets spanning 3 years (7 million influenza-related tweets in Japanese) was used to evaluate the model. Both direct information and indirect information that mention other places were used. As indirect information is less reliable (too noisy or too old) than direct information, the indirect information data were not used directly and were considered as inhibiting direct information. For example, when indirect information appeared often, it was considered as signifying that everyone already had a known disease, leading to a small amount of direct information.ResultsThe estimation performance of our approach was evaluated using the correlation coefficient between the number of influenza cases as the gold standard values and the estimated values by the proposed models. The results revealed that the baseline model (BASELINE+NLP) shows .36 and that the proposed model (TRAP+NLP) improved the accuracy (.70, +.34 points).ConclusionsThe proposed approach by which the indirect information inhibits direct information exhibited improved estimation performance not only in rural cities but also in urban cities, which demonstrated the effectiveness of the proposed method consisting of a TRAP model and natural language processing (NLP) classification.

Highlights

BackgroundThe increased use of social networking platforms entails more widely shared personal information
This study handles only the location name as indirect information, but various expressions have been used in indirect messages
This paper proposed a novel approach that uses direct information and indirect information that mentions other places for disease epidemic prediction

Summary

Introduction

BackgroundThe increased use of social networking platforms entails more widely shared personal information. A popular application of SNS data is health surveillance for predicting an outbreak of epidemics by detecting diseases from text messages posted on SNS platforms. Such applications share the following logic: they incorporate SNS users as social sensors. A collection of tweets spanning 3 years (7 million influenza-related tweets in Japanese) was used to evaluate the model Both direct information and indirect information that mention other places were used. Conclusions: The proposed approach by which the indirect information inhibits direct information exhibited improved estimation performance in rural cities and in urban cities, which demonstrated the effectiveness of the proposed method consisting of a TRAP model and natural language processing (NLP) classification

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Twitter-Based Influenza Detection After Flu Peak via Tweets With Indirect Information: Text Mining Study.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: JMIR Public Health and Surveillance

Lead the way for us

Similar Papers

After the boom no one tweets
Shoko Wakamiya ... Eiji Aramaki
-
Shoko Wakamiya, et. al.Shoko Wakamiya ... Eiji Aramaki
17 Oct 2016
17 Oct 2016

Presentation modality and indirect performance information: effects on ratings, reactions, and memory.
Krista L Uggerslev ... Lorne M Sulsky
The Journal of applied psychology | VOL. 87
Krista L Uggerslev, et. al.Krista L Uggerslev ... Lorne M Sulsky
01 Jan 2002
The Journal of applied psychology | VOL. 87

Domestic pigs' (Sus scrofa domestica) use of direct and indirect visual and auditory cues in an object choice task.
Christian Nawroth ... Eberhard Von Borell
Animal Cognition | VOL. 18
Christian Nawroth, et. al.Christian Nawroth ... Eberhard Von Borell
04 Feb 2015
Animal Cognition | VOL. 18

Exclusion Performance in Dwarf Goats (Capra aegagrus hircus) and Sheep (Ovis orientalis aries)
Christian Nawroth ... Alan Mcelligott
PLoS ONE | VOL. 9
Christian Nawroth, et. al.Christian Nawroth ... Alan Mcelligott
02 Apr 2014
PLoS ONE | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Twitter-Based Influenza Detection After Flu Peak via Tweets With Indirect Information: Text Mining Study.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: JMIR Public Health and Surveillance