Offensive, aggressive, and hate speech analysis: From data-centric to human-centered approach

Jan Kocoń,Alicja Figas,Marcin Gruza,Daria Puchalska,Tomasz Kajdanowicz,Przemysław Kazienko

doi:10.1016/j.ipm.2021.102643

Abstract

Analysis of subjective texts like offensive content or hate speech is a great challenge, especially regarding annotation process. Most of current annotation procedures are aimed at achieving a high level of agreement in order to generate a high quality reference source. However, the annotation guidelines for subjective content may restrict the annotators’ freedom of decision making. Motivated by a moderate annotation agreement in offensive content datasets, we hypothesize that personalized approaches to offensive content identification should be in place. Thus, we propose two novel perspectives of perception: group-based and individual. Using demographics of annotators as well as embeddings of their previous decisions (annotated texts), we are able to train multimodal models (including transformer-based) adjusted to personal or community profiles. Based on the agreement of individuals and groups, we experimentally showed that annotator group agreeability strongly correlates with offensive content recognition quality. The proposed personalized approaches enabled us to create models adaptable to personal user beliefs rather than to agreed offensiveness understanding. Overall, our individualized approaches to offensive content classification outperform classic data-centric methods that generalize offensiveness perception and it refers to all six tested models. Additionally, we developed requirements for annotation procedures, personalization and content processing to make the solutions human-centered.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Information Processing & Management	Publication Date: Jun 3, 2021
Citations: 48	License type: cc-by

R Discovery Prime

R Discovery Prime

Offensive, aggressive, and hate speech analysis: From data-centric to human-centered approach

Abstract

Talk to us

Similar Papers

More From: Information Processing & Management

Lead the way for us

Similar Papers

Evaluating Machine Learning Techniques for Detecting Offensive and Hate Speech in South African Tweets
Oluwafemi Oriola ... Eduan Kotze
IEEE Access | VOL. 8
Oluwafemi Oriola, et. al.Oluwafemi Oriola ... Eduan Kotze
01 Jan 2020
IEEE Access | VOL. 8

Detecting Offensive Content on Social Media During Anti-Lockdown Protests in Michigan
Jihye Moon ... Swapna S Gokhale
-
Jihye Moon, et. al.Jihye Moon ... Swapna S Gokhale
01 Jul 2021
01 Jul 2021

Detecting Offensive Content on Twitter During Proud Boys Riots
Md Fahim ... Swapna S Gokhale
-
Md Fahim, et. al.Md Fahim ... Swapna S Gokhale
01 Dec 2021
01 Dec 2021

Detection of Offensive Language and ITS Severity for Low Resource Language
Ramsha Saeed ... Sadaf Abdul Rauf
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 22
Ramsha Saeed, et. al.Ramsha Saeed ... Sadaf Abdul Rauf
17 Jun 2023
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Offensive, aggressive, and hate speech analysis: From data-centric to human-centered approach

Abstract

Talk to us

Similar Papers

More From: Information Processing &amp; Management

More From: Information Processing & Management