Improving Zero-shot Cross-lingual Hate Speech Detection with pseudo-label fine-tuning of Transformer Language Models

Haris Bin Zia ,Ignacio Castro ,Arkaitz Zubiaga ,Gareth Tyson

doi:10.48448/kxfk-4n79

Abstract

Hate speech has proliferated on social media platforms in recent years. While this has been the focus of many studies, most works have exclusively focused on a single language, generally English. Low-resourced languages have been neglected due to the dearth of labeled resources. These languages, however represent an important portion of the data due to the multilingual nature of social media. This work presents a novel zero-shot, cross-lingual transfer learning pipeline based on pseudo-label fine-tuning of Transformer Language Models for automatic hate speech detection. We employ our pipeline on benchmark datasets covering English (source) and 6 different non-English (target) languages written in 3 different scripts. Our pipeline achieves an average improvement of 7.6% (in terms of macro-F1) over previous zero-shot, cross-lingual models. This demonstrates the feasibility of high accuracy automatic hate speech detection for low-resource languages. We release our code and models at https://github.com/.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving Zero-shot Cross-lingual Hate Speech Detection with pseudo-label fine-tuning of Transformer Language Models

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Improving Zero-Shot Cross-Lingual Hate Speech Detection with Pseudo-Label Fine-Tuning of Transformer Language Models
Haris Bin Zia ... Ignacio Castro
Proceedings of the International AAAI Conference on Web and Social Media | VOL. 16
Haris Bin Zia, et. al.Haris Bin Zia ... Ignacio Castro
31 May 2022
Proceedings of the International AAAI Conference on Web and Social Media | VOL. 16

Combating hate speech using an adaptive ensemble learning model with a case study on COVID-19
Shivang Agarwal ... C Ravindranath Chowdary
Expert Systems with Applications | VOL. 185
Shivang Agarwal, et. al.Shivang Agarwal ... C Ravindranath Chowdary
27 Jul 2021
Expert Systems with Applications | VOL. 185

Accelerating automatic hate speech detection using parallelized ensemble learning models
Shivang Agarwal ... C Ravindranath Chowdary
Expert Systems with Applications | VOL. 230
Shivang Agarwal, et. al.Shivang Agarwal ... C Ravindranath Chowdary
05 Jun 2023
Expert Systems with Applications | VOL. 230

Vulnerable community identification using hate speech detection on social media
Zewdie Mossie ... Jenq-Haur Wang
Information Processing & Management | VOL. 57
Zewdie Mossie, et. al.Zewdie Mossie ... Jenq-Haur Wang
23 Jul 2019
Information Processing & Management | VOL. 57

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving Zero-shot Cross-lingual Hate Speech Detection with pseudo-label fine-tuning of Transformer Language Models

Abstract

Talk to us

Similar Papers