Abundant Topological Outliers in Social Media Data and Their Effect on Spatial Analysis.

Rene Westerholt,Bernd Resch,Alexander Zipf,Enrico Steiger

doi:10.1371/journal.pone.0162360

Rene Westerholt, Bernd Resch + Show 2 more

Open Access

https://doi.org/10.1371/journal.pone.0162360

Copy DOI

Abstract

Twitter and related social media feeds have become valuable data sources to many fields of research. Numerous researchers have thereby used social media posts for spatial analysis, since many of them contain explicit geographic locations. However, despite its widespread use within applied research, a thorough understanding of the underlying spatial characteristics of these data is still lacking. In this paper, we investigate how topological outliers influence the outcomes of spatial analyses of social media data. These outliers appear when different users contribute heterogeneous information about different phenomena simultaneously from similar locations. As a consequence, various messages representing different spatial phenomena are captured closely to each other, and are at risk to be falsely related in a spatial analysis. Our results reveal indications for corresponding spurious effects when analyzing Twitter data. Further, we show how the outliers distort the range of outcomes of spatial analysis methods. This has significant influence on the power of spatial inferential techniques, and, more generally, on the validity and interpretability of spatial analysis results. We further investigate how the issues caused by topological outliers are composed in detail. We unveil that multiple disturbing effects are acting simultaneously and that these are related to the geographic scales of the involved overlapping patterns. Our results show that at some scale configurations, the disturbances added through overlap are more severe than at others. Further, their behavior turns into a volatile and almost chaotic fluctuation when the scales of the involved patterns become too different. Overall, our results highlight the critical importance of thoroughly considering the specific characteristics of social media data when analyzing them spatially.

Highlights

One aspect in the analysis of social phenomena is the search for spatial structures and patterns
In this paper we investigate how topological outliers caused by the abovementioned heterogeneities influence spatial analysis methodology in a general sense
We outline some problematic covariation-based characteristics that emerge when analyzing Twitter messages with established spatial analysis methods (‘Indications from the Twitter dataset’). Afterwards, we investigate these characteristics within a simulated dataset, the latter allowing us to control different parameters such as spatial scale and attributes

Summary

Methods

One of them is a Twitter dataset consisting of georeferenced tweets. It has been crawled through the publicly available Streaming API during a period of approximately one year. We only leveraged explicit coordinates offered in the form of latitude-longitude tuples This may include GPS-derived locations as well positions determined by WiFipositioning techniques and check-ins (see Section ‘Indications from the Twitter dataset’ for further discussion of this point). This heat map allows disaggregating the overall autocovariance into its constituting parts The benefit of this approach is that, other than with a covariogram or a correlogram, we are neither aggregating by distance bands nor by random variables.

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS ONE	Publication Date: Sep 9, 2016
Citations: 11	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Abundant Topological Outliers in Social Media Data and Their Effect on Spatial Analysis.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE

Lead the way for us

Similar Papers

Chapter 3 - Social media data analytics using feature engineering
J Anitha ... R.V Belfin
Systems Simulation and Modeling for Cloud Computing and Big Data Applications | VOL. -
J Anitha, et. al.J Anitha ... R.V Belfin
01 Jan 2020
Systems Simulation and Modeling for Cloud Computing and Big Data Applications | VOL. -

Application and Analysis of Social Media Data in Economic Research
Meiyang Wang
SHS Web of Conferences | VOL. 200
Meiyang WangMeiyang Wang
01 Jan 2024
SHS Web of Conferences | VOL. 200

Social media data in affective science
Max Pellert ... David Garcia
-
Max Pellert, et. al.Max Pellert ... David Garcia
10 Nov 2021
10 Nov 2021

Using Social Media Data in Routine Pharmacovigilance: A Pilot Study to Identify Safety Signals and Patient Perspectives
Mondira Bhattacharya ... Murray Malin
Pharmaceutical Medicine | VOL. 31
Mondira Bhattacharya, et. al.Mondira Bhattacharya ... Murray Malin
17 Apr 2017
Pharmaceutical Medicine | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Abundant Topological Outliers in Social Media Data and Their Effect on Spatial Analysis.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE