AnnoBERT: Effectively Representing Multiple Annotators’ Label Choices to Improve Hate Speech Detection

Wenjie Yin,Aiqi Jiang,Arkaitz Zubiaga,Vibhor Agarwal,Nishanth Sastry

doi:10.1609/icwsm.v17i1.22198

Abstract

Supervised machine learning approaches often rely on a "ground truth" label. However, obtaining one label through majority voting ignores the important subjectivity information in tasks such hate speech detection. Existing neural network models principally regard labels as categorical variables, while ignoring the semantic information in diverse label texts. In this paper, we propose AnnoBERT, a first-of-its-kind architecture integrating annotator characteristics and label text with a transformer-based model to detect hate speech, with unique representations based on each annotator's characteristics via Collaborative Topic Regression (CTR) and integrate label text to enrich textual representations. During training, the model associates annotators with their label choices given a piece of text; during evaluation, when label information is not available, the model predicts the aggregated label given by the participating annotators by utilising the learnt association. The proposed approach displayed an advantage in detecting hate speech, especially in the minority class and edge cases with annotator disagreement. Improvement in the overall performance is the largest when the dataset is more label-imbalanced, suggesting its practical value in identifying real-world hate speech, as the volume of hate speech in-the-wild is extremely small on social media, when compared with normal (non-hate) speech. Through ablation studies, we show the relative contributions of annotator embeddings and label text to the model performance, and tested a range of alternative annotator embeddings and label text combinations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

AnnoBERT: Effectively Representing Multiple Annotators’ Label Choices to Improve Hate Speech Detection

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International AAAI Conference on Web and Social Media

Lead the way for us

Journal: Proceedings of the International AAAI Conference on Web and Social Media	Publication Date: Jun 2, 2023
Citations: 8

Similar Papers

Sinhala Hate Speech Detection in Social Media using Text Mining and Machine learning
H.M.S.T Sandaruwan ... S.A.S Lorensuhewa
-
H.M.S.T Sandaruwan, et. al.H.M.S.T Sandaruwan ... S.A.S Lorensuhewa
01 Sep 2019
01 Sep 2019

Evaluating Machine Learning Techniques for Detecting Offensive and Hate Speech in South African Tweets
Oluwafemi Oriola ... Eduan Kotze
IEEE Access | VOL. 8
Oluwafemi Oriola, et. al.Oluwafemi Oriola ... Eduan Kotze
01 Jan 2020
IEEE Access | VOL. 8

Hate speech and offensive language detection in Dravidian languages using deep ensemble framework
Pradeep Kumar Roy ... Chinnaudayar Navaneethakrishnan Subalalitha
Computer Speech & Language | VOL. 75
Pradeep Kumar Roy, et. al.Pradeep Kumar Roy ... Chinnaudayar Navaneethakrishnan Subalalitha
05 Apr 2022
Computer Speech & Language | VOL. 75

Emotionally Informed Hate Speech Detection: A Multi-target Perspective
Patricia Chiril ... Viviana Patti
Cognitive computation | VOL. 14
Patricia Chiril, et. al.Patricia Chiril ... Viviana Patti
28 Jun 2021
Cognitive computation | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AnnoBERT: Effectively Representing Multiple Annotators’ Label Choices to Improve Hate Speech Detection

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International AAAI Conference on Web and Social Media