Transfer learning for hate speech detection in social media

Lanqin Yuan,Tianyu Wang,Gabriela Ferraro,Hanna Suominen,Marian-Andrei Rizoiu

doi:10.1007/s42001-023-00224-9

Abstract

Today, the internet is an integral part of our daily lives, enabling people to be more connected than ever before. However, this greater connectivity and access to information increase exposure to harmful content, such as cyber-bullying and cyber-hatred. Models based on machine learning and natural language offer a way to make online platforms safer by identifying hate speech in web text autonomously. However, the main difficulty is annotating a sufficiently large number of examples to train these models. This paper uses a transfer learning technique to leverage two independent datasets jointly and builds a single representation of hate speech. We build an interpretable two-dimensional visualization tool of the constructed hate speech representation—dubbed the Map of Hate—in which multiple datasets can be projected and comparatively analyzed. The hateful content is annotated differently across the two datasets (racist and sexist in one dataset, hateful and offensive in another). However, the common representation successfully projects the harmless class of both datasets into the same space and can be used to uncover labeling errors (false positives). We also show that the joint representation boosts prediction performances when only a limited amount of supervision is available. These methods and insights hold the potential for safer social media and reduce the need to expose human moderators and annotators to distressing online messaging.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Computational Social Science	Publication Date: Oct 1, 2023
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Transfer learning for hate speech detection in social media

Abstract

Talk to us

Similar Papers

More From: Journal of Computational Social Science

Lead the way for us

Similar Papers

PROTOCOL: Online interventions for reducing hate speech and cyberhate: A systematic review.
Steven Windisch ... Susann Wiedlitzka
Campbell systematic reviews | VOL. 17
Steven Windisch, et. al.Steven Windisch ... Susann Wiedlitzka
13 Jan 2021
Campbell systematic reviews | VOL. 17

A New Hate Speech Detection System based on Textual and Psychological Features
Fatimah Alkomah ... Xiaogang Ma
International Journal of Advanced Computer Science and Applications | VOL. 13
Fatimah Alkomah, et. al.Fatimah Alkomah ... Xiaogang Ma
01 Jan 2021
International Journal of Advanced Computer Science and Applications | VOL. 13

Sinhala Hate Speech Detection in Social Media using Text Mining and Machine learning
H.M.S.T Sandaruwan ... S.A.S Lorensuhewa
-
H.M.S.T Sandaruwan, et. al.H.M.S.T Sandaruwan ... S.A.S Lorensuhewa
01 Sep 2019
01 Sep 2019

The Virality of Hate Speech on Social Media
Abdurahman Maarouf ... Nicolas Pröllochs
Proceedings of the ACM on Human-Computer Interaction | VOL. 8
Abdurahman Maarouf, et. al.Abdurahman Maarouf ... Nicolas Pröllochs
17 Apr 2024
Proceedings of the ACM on Human-Computer Interaction | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Transfer learning for hate speech detection in social media

Abstract

Talk to us

Similar Papers

More From: Journal of Computational Social Science