Abstract
ABSTRACTEarly detection of influenza outbreaks is one of the key priorities on a national level for preparedness and planning. This study presents the design and implementation of Fluwitter, which is a spatio-temporal web-based prototype framework for pseudo real-time detection of influenza outbreaks from Twitter. Specifically, the framework integrates PostgreSQL database server with PostGIS spatial extension, Twitter streaming client, pre-processor, tagger and similarity calculator for semantic information extraction (IE). The IE of tagged terms is supported by Natural Language Processing (NLP) techniques, DBpediaSpotlight and WordNet Similarity for Java (WS4J), while data analytics, visualization, and mapping are supported by GeoServer and other GIS Free Open Source Software (FOSS). The prototype was calibrated to maximize detection of influenza using rules developed from ontology-based semantic similarity scores. The Twitter-generated influenza cases were validated by weekly hospitalization records issued by Ohio Department of Health (ODH). The optimized rule produced a final F-measure value of 0.72 and accuracy (ACC) value of 94.4%. The validation suggested the existence of moderate correlations for the beginning of the time period Southeast region (r = 0.52), the Northwestern region (r = 0.38), and the Central region (r = 0.33) and weak correlations for the entire time period. The potential strengths and benefits of the prototype are shown through spatio-temporal assessment and visualization of influenza potential in Ohio.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.