Dictionary-based and machine learning classification approaches: a comparison for tonality and frame detection on Twitter data

Maud Reveilhac,Davide Morselli

doi:10.1080/2474736x.2022.2029217

Maud Reveilhac, Davide Morselli

Open Access

https://doi.org/10.1080/2474736x.2022.2029217

Copy DOI

Journal: Political Research Exchange	Publication Date: Feb 1, 2022
Citations: 7	License type: open-access

Affiliation: University of Lausanne

Abstract

ABSTRACT Automated text analysis methods have made it possible to classify large corpora of text by measures such as frames and tonality, with a growing popularity in social, political and psychological science. These methods often demand a training dataset of sufficient size to generate accurate models that can be applied to unseen texts. In practice, however, there are no clear recommendations about how big the training samples should be. This issue becomes especially acute when dealing with texts skewed toward categories and when researchers cannot afford large samples of annotated texts. Leveraging on the case of support for democracy, we provide a guide to help researchers navigate decisions when producing measures of tonality and frames from a small sample of annotated social media posts. We find that supervised machine learning algorithms outperform dictionaries for tonality classification tasks. However, custom dictionaries are useful complements of these algorithms when identifying latent democracy dimensions in social media messages, especially as the method of elaborating these dictionaries is guided by word embedding techniques and human validation. Therefore, we provide easily implementable recommendations to increase estimation accuracy under non-optimal condition.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dictionary-based and machine learning classification approaches: a comparison for tonality and frame detection on Twitter data

Abstract

Talk to us

Similar Papers

More From: Political Research Exchange

Lead the way for us

Similar Papers

Identification of Social Media Posts Containing Self-reported COVID-19 Symptoms using Triple Word Embeddings and Long Short-Term Memory
Raisa Amalia ... Muhammad Itqan Mazdadi
Telematika | VOL. 17
Raisa Amalia, et. al.Raisa Amalia ... Muhammad Itqan Mazdadi
16 Feb 2024
Telematika | VOL. 17

Going Viral: The 3 Rs of Social Media Messaging during Public Health Emergencies.
Bhavini Patel Murthy ... Tanya Telfair Leblanc
Health security | VOL. 19
Bhavini Patel Murthy, et. al.Bhavini Patel Murthy ... Tanya Telfair Leblanc
01 Feb 2021
Health security | VOL. 19

Social media and customer behavior analytics for personalized customer engagements
S Buckley ... M Petrik
IBM Journal of Research and Development | VOL. 58
S Buckley, et. al.S Buckley ... M Petrik
01 Sep 2014
IBM Journal of Research and Development | VOL. 58

Crime Detection and Analysis from Social Media Messages Using Machine Learning and Natural Language Processing Technique
Xolani Lombo ... Absalom E Ezugwu
-
Xolani Lombo, et. al.Xolani Lombo ... Absalom E Ezugwu
01 Jan 2021
01 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dictionary-based and machine learning classification approaches: a comparison for tonality and frame detection on Twitter data

Abstract

Talk to us

Similar Papers

More From: Political Research Exchange