Language Statistics at Different Spatial, Temporal, and Grammatical Scales.

Fernanda Sánchez-Puig,Rogelio Lozano-Aranda,Dante Pérez-Méndez,Ewan Colman,Alfredo J Morales-Guzmán,Pedro Juan Rivera Torres,Carlos Pineda,Carlos Gershenson

doi:10.3390/e26090734

Abstract

In recent decades, the field of statistical linguistics has made significant strides, which have been fueled by the availability of data. Leveraging Twitter data, this paper explores the English and Spanish languages, investigating their rank diversity across different scales: temporal intervals (ranging from 3 to 96 h), spatial radii (spanning 3 km to over 3000 km), and grammatical word ngrams (ranging from 1-grams to 5-grams). The analysis focuses on word ngrams, examining a time period of 1 year (2014) and eight different countries. Our findings highlight the relevance of all three scales with the most substantial changes observed at the grammatical level. Specifically, at the monogram level, rank diversity curves exhibit remarkable similarity across languages, countries, and temporal or spatial scales. However, as the grammatical scale expands, variations in rank diversity become more pronounced and influenced by temporal, spatial, linguistic, and national factors. Additionally, we investigate the statistical characteristics of Twitter-specific tokens, including emojis, hashtags, and user mentions, revealing a sigmoid pattern in their rank diversity function. These insights contribute to quantifying universal language statistics while also identifying potential sources of variation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Language Statistics at Different Spatial, Temporal, and Grammatical Scales.

Abstract

Talk to us

Similar Papers

More From: Entropy (Basel, Switzerland)

Lead the way for us

Journal: Entropy (Basel, Switzerland)	Publication Date: Aug 29, 2024
License type: cc-by

Similar Papers

Relationships between spatial and temporal variabilities in airborne metal distributions in Won Ju City, Korea
Ki-Hyun Kim
Environment International | VOL. 29
Ki-Hyun KimKi-Hyun Kim
15 Mar 2003
Environment International | VOL. 29

Ecological sexual segregation in fallow deer (Dama dama): a multispatial and multitemporal approach
Simone Ciuti ... Marco Apollonio
Behavioral Ecology and Sociobiology | VOL. 62
Simone Ciuti, et. al.Simone Ciuti ... Marco Apollonio
28 May 2008
Behavioral Ecology and Sociobiology | VOL. 62

Effect of spatial and temporal scales on habitat suitability modeling: A case study of Ommastrephes bartramii in the northwest pacific ocean
Caixia Gong ... Xinjun Chen
Journal of Ocean University of China | VOL. 13
Caixia Gong, et. al.Caixia Gong ... Xinjun Chen
22 Oct 2014
Journal of Ocean University of China | VOL. 13

森林水源涵养功能的多尺度内涵、过程及计量方法
王晓学 Wang Xiaoxue ... 景峰 Jing Feng
Acta Ecologica Sinica | VOL. 33
王晓学 Wang Xiaoxue, et. al.王晓学 Wang Xiaoxue ... 景峰 Jing Feng
01 Jan 2013
Acta Ecologica Sinica | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Language Statistics at Different Spatial, Temporal, and Grammatical Scales.

Abstract

Talk to us

Similar Papers

More From: Entropy (Basel, Switzerland)