Internet Texts Research Articles

The article is devoted to the gendered lexical unit which constitutes initial abbreviation derived from the idiom «razvedenka s pricepom» (divorsed woman with a child) and is actively used by Russian speaking Internet users to denote a female person on the basis of her marital status. The article describes the history of the appearance of the unit, its derivational and semantical characteristics, semantic and contextual connections and its functioning specifics in the Internet texts. The research is conducted on the material of the author's corpus of the contexts of use including 1 255 speech products published in between 2014 and 2023 in the communicative space of such social platforms as (425 units), vk.com (336 units), YouTube (190 units), Yandex.Dzen (173 units). The author applies the methods of quantative, semantic, conceptual, derivational, contextual and discursive analyses. The article specifies that the gendered abbreviation is derived from the linguocynical idiom, having been used in the male speech long before the spread of the Internet. The research reveals that the abbreviation use underlines the artifact conceptual metaphor generated by the initial phraseological combination. The divorced mother deanimization is regularly strengthened by the abrriviation's contextual environment. It reveals that 1 175 contexts (93,62 %) contain verbalized speaker's conception of financial aspect in sexual, romantic or family relationship caused by the actualization of consumer strategy in interpersonal ineteraction and by the ubiquitous consumeristic philosophy. It proves that the use of the innovation RSP is a feature of male Internet users' speech (982 contexts; 78,24 %) and is not typical for female speech (270 contexts; 21,51 %). In female speech the abbreviation carries out primarily the metalinguistic function and much less frequently nominative and evaluative functions. In the Internet texts produced by male persons the nomination not only promotes expressing the divorced mother with a child negative evaluation but also facilitates deconstruction, deaxiologization and desacralization of the concept «Mother». This allows us to qualify the abbreviation as a linguo-perversion which is a special kind of language trauma which represents a speech phenomenon generating the traditional symbol ambivalence and leading towards worldview destruction.

Contextual variables that capture the characteristics of delimited geographic or jurisdictional areas are vital for health and social research. However, obtaining data sets with contextual-level data can be challenging in the absence of monitoring systems or public census data. We describe and implement an 8-step method that combines web scraping, text mining, and spatial overlay analysis (WeTMS) to transform extensive text data from government websites into analyzable data sets containing contextual data for jurisdictional areas. This tutorial describes the method and provides resources for its application by health and social researchers. We used this method to create data sets of health assets aimed at enhancing older adults' social connections (eg, activities and resources such as walking groups and senior clubs) across the 374 health jurisdictions in Catalonia from 2015 to 2022. These assets are registered on a web-based government platform by local stakeholders from various health and nonhealth organizations as part of a national public health program. Steps 1 to 3 involved defining the variables of interest, identifying data sources, and using Python to extract information from 50,000 websites linked to the platform. Steps 4 to 6 comprised preprocessing the scraped text, defining new variables to classify health assets based on social connection constructs, analyzing word frequencies in titles and descriptions of the assets, creating topic-specific dictionaries, implementing a rule-based classifier in R, and verifying the results. Steps 7 and 8 integrate the spatial overlay analysis to determine the geographic location of each asset. We conducted a descriptive analysis of the data sets to report the characteristics of the assets identified and the patterns of asset registrations across areas. We identified and extracted data from 17,305 websites describing health assets. The titles and descriptions of the activities and resources contained 12,560 and 7301 unique words, respectively. After applying our classifier and spatial analysis algorithm, we generated 2 data sets containing 9546 health assets (5022 activities and 4524 resources) with the potential to enhance social connections among older adults. Stakeholders from 318 health jurisdictions registered identified assets on the platform between July 2015 and December 2022. The agreement rate between the classification algorithm and verified data sets ranged from 62.02% to 99.47% across variables. Leisure and skill development activities were the most prevalent (1844/5022, 36.72%). Leisure and cultural associations, such as social clubs for older adults, were the most common resources (878/4524, 19.41%). Health asset registration varied across areas, ranging between 0 and 263 activities and 0 and 265 resources. The sequential use of WeTMS offers a robust method for generating data sets containing contextual-level variables from internet text data. This study can guide health and social researchers in efficiently generating ready-to-analyze data sets containing contextual variables.

Internet Texts Research Articles

Related Topics

Articles published on Internet Texts

Embers of autoregression show how large language models are shaped by the problem they are trained to solve

Новые формы прецедентного текста в современной интернет-коммуникации

THE POETIC NATURE OF INTERNET-TEXTS AS A NATIONAL CHARACTER TRAIT

Multi-word expressions for Russian L2 learners: corpora-based selection with expert verification

Russian Logistics and Supply Chain Management: Challenges and Relevant Solutions

Образные единицы в коммуникативных практиках медиатизированного политического дискурса

The female nomination on the basis of marital status in Russian Internet texts: functional and linguo-ecological aspects

Extracting intersectional stereotypes from embeddings: Developing and validating the Flexible Intersectional Stereotype Extraction procedure.

Investor sentiment and stock returns: New evidence from Chinese carbon-neutral stock markets based on multi-source data

The digital-mediated extensive reading on English Language learning of agriculture students

ChatGPT offers an editorial on the opportunities for chatbots in dermatologic research and patient care.

ChatGPT-A Generative Pre-Trained Transformer

Generating Contextual Variables From Web-Based Data for Health Research: Tutorial on Web Scraping, Text Mining, and Spatial Overlay Analysis.

How Generative AI Was Mentioned in Social Media and Academic Field? A Text Mining Based on Internet Text Data

Exploring the Role of Chinese Language and Literature in the Transmission of Traditional Culture by Combining the Method of Internet Text Analysis

Discursive formulas of response in Modern Greek

Analyzing Sentiment with Self-Organizing Map and Long Short-Term Memory Algorithms

Metaphor as Invective in the Genre of Internet Commentary: A Focus on German Political Discourse

Network text and problems of its linguo-expert research (results of the round table "Linguistic expert research on conflict in Internet communication")

Конфликт интерпретаций текста как следствие когнитивного диссонанса участников интернет-обсуждений

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Internet Texts Research Articles

Related Topics

Articles published on Internet Texts

Embers of autoregression show how large language models are shaped by the problem they are trained to solve

Новые формы прецедентного текста в современной интернет-коммуникации

THE POETIC NATURE OF INTERNET-TEXTS AS A NATIONAL CHARACTER TRAIT

Multi-word expressions for Russian L2 learners: corpora-based selection with expert verification

Russian Logistics and Supply Chain Management: Challenges and Relevant Solutions

Образные единицы в коммуникативных практиках медиатизированного политического дискурса

The female nomination on the basis of marital status in Russian Internet texts: functional and linguo-ecological aspects

Extracting intersectional stereotypes from embeddings: Developing and validating the Flexible Intersectional Stereotype Extraction procedure.

Investor sentiment and stock returns: New evidence from Chinese carbon-neutral stock markets based on multi-source data

The digital-mediated extensive reading on English Language learning of agriculture students

ChatGPT offers an editorial on the opportunities for chatbots in dermatologic research and patient care.

ChatGPT-A Generative Pre-Trained Transformer

Generating Contextual Variables From Web-Based Data for Health Research: Tutorial on Web Scraping, Text Mining, and Spatial Overlay Analysis.

How Generative AI Was Mentioned in Social Media and Academic Field? A Text Mining Based on Internet Text Data

Exploring the Role of Chinese Language and Literature in the Transmission of Traditional Culture by Combining the Method of Internet Text Analysis

Discursive formulas of response in Modern Greek

Analyzing Sentiment with Self-Organizing Map and Long Short-Term Memory Algorithms

Metaphor as Invective in the Genre of Internet Commentary: A Focus on German Political Discourse

Network text and problems of its linguo-expert research (results of the round table "Linguistic expert research on conflict in Internet communication")

Конфликт интерпретаций текста как следствие когнитивного диссонанса участников интернет-обсуждений