Predicting the Relevance of Social Media Posts Based on Linguistic Features and Journalistic Criteria

Alexandre Pinto,Álvaro Figueira,Ana Oliveira Alves,Hugo Gonçalo Oliveira

doi:10.1007/s00354-017-0015-1

Abstract

An overwhelming quantity of messages is posted in social networks every minute. To make the utilization of these platforms more productive, it is imperative to filter out information that is irrelevant to the general audience, such as private messages, personal opinions or well-known facts. This work is focused on the automatic classification of public social text according to its potential relevance, from a journalistic point of view, hopefully improving the overall experience of using a social network. Our experiments were based on a set of posts with several criteria, including the journalistic relevance, assessed by human judges. To predict the latter, we rely exclusively on linguistic features, extracted by Natural Language Processing tools, regardless the author of the message and its profile information. In our first approach, different classifiers and feature engineering methods were used to predict relevance directly from the selected features. In a second approach, relevance was predicted indirectly, based on an ensemble of classifiers for other key criteria when defining relevance—controversy, interestingness, meaningfulness, novelty, reliability and scope—also in the dataset. The first approach achieved a F 1-score of 0.76 and an Area under the ROC curve (AUC) of 0.63. But the best results were achieved by the second approach, with the best learned model achieving a F 1-score of 0.84 with an AUC of 0.78. This confirmed that journalistic relevance can indeed be predicted by the combination of the selected criteria, and that linguistic features can be exploited to classify the latter.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Predicting the Relevance of Social Media Posts Based on Linguistic Features and Journalistic Criteria

Abstract

Talk to us

Similar Papers

More From: New Generation Computing

Lead the way for us

Journal: New Generation Computing	Publication Date: Apr 25, 2017
Citations: 4

Similar Papers

Anthropometric indicators as discriminators of high body fat in children and adolescents with HIV: comparison with reference methods.
Carlos A Souza Alves Jr ... Luiz R Augustemak De Lima
Minerva pediatrics | VOL. 75
Carlos A Souza Alves Jr, et. al.Carlos A Souza Alves Jr ... Luiz R Augustemak De Lima
01 Nov 2023
Minerva pediatrics | VOL. 75

Natural Language Processing and the Promise of Big Data: Small Step Forward, but Many Miles to Go.
Thomas M Maddox ... Michael A Matheny
Circulation. Cardiovascular quality and outcomes | VOL. 8
Thomas M Maddox, et. al.Thomas M Maddox ... Michael A Matheny
18 Aug 2015
Circulation. Cardiovascular quality and outcomes | VOL. 8

B-type natriuretic peptide informativeness in myocardial revascularization with cardio-pulmonary bypass
I A Kozlov ... V Yu Rybakov
Messenger of ANESTHESIOLOGY AND RESUSCITATION | VOL. 21
I A Kozlov, et. al.I A Kozlov ... V Yu Rybakov
25 Aug 2024
Messenger of ANESTHESIOLOGY AND RESUSCITATION | VOL. 21

Optimising speech‐testing to predict prodromal Alzheimer’s disease: head‐to‐head comparison study of tasks and analysis methods
Udeepa Meepegama ... Jack Weston
Alzheimer's & Dementia | VOL. 19
Udeepa Meepegama, et. al.Udeepa Meepegama ... Jack Weston
01 Dec 2023
Alzheimer's & Dementia | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Predicting the Relevance of Social Media Posts Based on Linguistic Features and Journalistic Criteria

Abstract

Talk to us

Similar Papers

More From: New Generation Computing