Freshman or Fresher? Quantifying the Geographic Variation of Language in Online Social Media

Vivek Kulkarni,Steven Skiena,Bryan Perozzi

doi:10.1609/icwsm.v10i1.14798

Abstract

In this paper we present a new computational technique to detect and analyze statistically significant geographic variation in language. While previous approaches have primarily focused on lexical variation between regions, our method identifies words that demonstrate semantic and syntactic variation as well. We extend recently developed techniques for neural language models to learn word representations which capture differing semantics across geographical regions. In order to quantify this variation and ensure robust detection of true regional differences, we formulate a null model to determine whether observed changes are statistically significant. Our method is the first such approach to explicitly account for random variation due to chance while detecting regional variation in word meaning. To validate our model, we study and analyze two different massive online data sets: millions of tweets from Twitter as well as millions of phrases contained in the Google Book Ngrams. Our analysis reveals interesting facets of language change across countries.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Freshman or Fresher? Quantifying the Geographic Variation of Language in Online Social Media

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International AAAI Conference on Web and Social Media

Lead the way for us

Journal: Proceedings of the International AAAI Conference on Web and Social Media	Publication Date: Aug 4, 2021
Citations: 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Freshman or Fresher? Quantifying the Geographic Variation of Language in Online Social Media

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International AAAI Conference on Web and Social Media