Abstract

Extracting geospatially rich knowledge from tweets is of utmost importance for location-based systems in emergency services to raise situational awareness about a given crisis-related incident, such as earthquakes, floods, car accidents, terrorist attacks, shooting attacks, etc. The problem is that the majority of tweets are not geotagged, so we need to resort to the messages in the search of geospatial evidence. In this context, we present LORE, a location-detection system for tweets that leverages the geographic database GeoNames together with linguistic knowledge through NLP techniques. One of the main contributions of this model is to capture fine-grained complex locative references, ranging from geopolitical entities and natural geographic references to points of interest and traffic ways. LORE outperforms state-of-the-art open-source location-extraction systems (i.e. Stanford NER, spaCy, NLTK and OpenNLP), achieving an unprecedented trade-off between precision and recall. Therefore, our model provides not only a quantitative advantage over other well-known systems in terms of performance but also a qualitative advantage in terms of the diversity and semantic granularity of the locative references extracted from the tweets.

Highlights

  • Twitter is one of the most widely used and popular microblogging and social media sites, as well as one of the most investigated in event detection, sentiment analysis and geolocation, among many other research areas (Murthy, 2018)

  • The problem is that geotagged tweets represent around 1% of tweets only (Middleton et al, 2014), which hinders the development of geolocation systems for Twitter

  • Named Entity Recognition (NER)-based approaches applied to microblogging services such as Twitter perform reasonably well when confronting the challenges presented by the noisy nature of tweets

Read more

Summary

Introduction

Twitter is one of the most widely used and popular microblogging and social media sites, as well as one of the most investigated in event detection, sentiment analysis and geolocation, among many other research areas (Murthy, 2018). Location detection should not be confused with other terms such as geocoding (Middleton et al, 2018) or geotagging (Gritta et al, 2018), which deal with the assignation of spatial coordinates to locative references after being disambiguated (Gritta et al, 2018) In this context, geoparsing (Leidner & Lieberman, 2011; Liu et al, 2014) usually consists of two phases, location detection and location disambiguation (Gelernter & Balaji, 2013; Purves et al, 2018; Wallgrün et al, 2018). According to Jurafsky & Martin (2020), there are three types of NER-based models:

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.