Abstract

Social media platforms represent a deep resource for academic research and a wide range of untapped possibilities for linguists (D'ARCY; YOUNG, 2012). This rapidly developing field presents various ethical issues and unique challenges regarding methods to retrieve and analyze data. This tutorial provides a straightforward guide to harvesting and tidying Twitter data, focused mainly on the Tweets' text, by using the R programming language (R CORE TEAM, 2020) via Twitter's APIs. The R code was developed in Adams (2020), based on the rtweet package (KEARNEY, 2018), and successfully resulted in a script for corpora compilation. In this tutorial, we discuss limitations, problems, and solutions in our framework for conducting ethical research on this social networking site. Our ethical concerns go beyond what we "agree to" in terms of use and privacy policies, that is, we argue that their content does not contemplate all the concerns researchers need to attend to. Additionally, our aim is to show that using Twitter as a data source does not require advanced computational skills.

Highlights

  • The realm of social media presents a wide range of possibilities for linguistic research, which raises unique methodological challenges and ethical issues (D’ARCY; YOUNG, 2012)

  • Aside from corpora compilation, the R programming language is a free software environment that can be used for several computational tasks, such as statistical computing, graphics, among others

  • This guide is not an introduction to R for linguists nor to data science or to tidyverse, it intends to show that collecting data via Twitter application programming interface (API) is not as daunting and does not require advanced computational skills as it can initially seem

Read more

Summary

A GUIDE ON EXTRACTING AND TIDYING TWEETS WITH R

Julia Bahia ADAMS Instituto de Estudos da Linguagem – Universidade Estadual de Campinas (UNICAMP). Carlos Augusto Jardim CHIARELLI Faculdade de Engenharia Mecânica – Universidade Estadual de Campinas (UNICAMP). Conceptualização, Metodologia, Software, Escrita – Rascunho Original e Escrita – Análise e Edição. HOW TO CITE ADAMS, J.B.; CHIARELLI, C.A.J. A Guide on Extracting and Tidying Tweets with R.

INTRODUCTION
DATA HARVESTING
Findings
ETHICS OF SOCIAL MEDIA RESEARCH

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.