Automatic Hate Speech Detection Research Articles

Hate Speech and harassment are widespread in online communication, due to users' freedom and anonymity and the lack of regulation provided by social media platforms. Hate speech is topically focused (misogyny, sexism, racism, xenophobia, homophobia, etc.), and each specific manifestation of hate speech targets different vulnerable groups based on characteristics such as gender (misogyny, sexism), ethnicity, race, religion (xenophobia, racism, Islamophobia), sexual orientation (homophobia), and so on. Most automatic hate speech detection approaches cast the problem into a binary classification task without addressing either the topical focus or the target-oriented nature of hate speech. In this paper, we propose to tackle, for the first time, hate speech detection from a multi-target perspective. We leverage manually annotated datasets, to investigate the problem of transferring knowledge from different datasets with different topical focuses and targets. Our contribution is threefold: (1) we explore the ability of hate speech detection models to capture common properties from topic-generic datasets and transfer this knowledge to recognize specific manifestations of hate speech; (2) we experiment with the development of models to detect both topics (racism, xenophobia, sexism, misogyny) and hate speech targets, going beyond standard binary classification, to investigate how to detect hate speech at a finer level of granularity and how to transfer knowledge across different topics and targets; and (3) we study the impact of affective knowledge encoded in sentic computing resources (SenticNet, EmoSenticNet) and in semantically structured hate lexicons (HurtLex) in determining specific manifestations of hate speech. We experimented with different neural models including multitask approaches. Our study shows that: (1) training a model on a combination of several (training sets from several) topic-specific datasets is more effective than training a model on a topic-generic dataset; (2) the multi-task approach outperforms a single-task model when detecting both the hatefulness of a tweet and its topical focus in the context of a multi-label classification approach; and (3) the models incorporating EmoSenticNet emotions, the first level emotions of SenticNet, a blend of SenticNet and EmoSenticNet emotions or affective features based on Hurtlex, obtained the best results. Our results demonstrate that multi-target hate speech detection from existing datasets is feasible, which is a first step towards hate speech detection for a specific topic/target when dedicated annotated data are missing. Moreover, we prove that domain-independent affective knowledge, injected into our models, helps finer-grained hate speech detection.

Read full abstract

Hate speech is an increasingly important societal issue in the era of digital communication. Hateful expressions often make use of figurative language and, although they represent, in some sense, the dark side of language, they are also often prime examples of creative use of language. While hate speech is a global phenomenon, current studies on automatic hate speech detection are typically framed in a monolingual setting. In this work, we explore hate speech detection in low-resource languages by transferring knowledge from a resource-rich language, English, in a zero-shot learning fashion. We experiment with traditional and recent neural architectures, and propose two joint-learning models, using different multilingual language representations to transfer knowledge between pairs of languages. We also evaluate the impact of additional knowledge in our experiment, by incorporating information from a multilingual lexicon of abusive words. The results show that our joint-learning models achieve the best performance on most languages. However, a simple approach that uses machine translation and a pre-trained English language model achieves a robust performance. In contrast, Multilingual BERT fails to obtain a good performance in cross-lingual hate speech detection. We also experimentally found that the external knowledge from a multilingual abusive lexicon is able to improve the models’ performance, specifically in detecting the positive class. The results of our experimental evaluation highlight a number of challenges and issues in this particular task. One of the main challenges is related to the issue of current benchmarks for hate speech detection, in particular how bias related to the topical focus in the datasets influences the classification performance. The insufficient ability of current multilingual language models to transfer knowledge between languages in the specific hate speech detection task also remain an open problem. However, our experimental evaluation and our qualitative analysis show how the explicit integration of linguistic knowledge from a structured abusive language lexicon helps to alleviate this issue.

Read full abstract

Automatic Hate Speech Detection Research Articles

Related Topics

Articles published on Automatic Hate Speech Detection

Roman urdu hate speech detection using hybrid machine learning models and hyperparameter optimization.

Automatic hate speech detection in audio using machine learning algorithms

Automatic Hate Speech Detection and the hassle of Offensive Language

Hate Speech Detection in Arabic Text: Survey

T5 for Hate Speech, Augmented Data, and Ensemble

An approach to automatic classification of hate speech in sports domain on social media

Accelerating automatic hate speech detection using parallelized ensemble learning models

Exploring Automatic Hate Speech Detection on Social Media: A Focus on Content-Based Analysis

Automated offensive language classification through “emotional” comments from network users

Assessing the Impact of Contextual Information in Hate Speech Detection

Automatic hate speech detection using aspect based feature extraction and Bi-LSTM model

Quantifying the impact of context on the quality of manual hate speech annotation

Improving Zero-Shot Cross-Lingual Hate Speech Detection with Pseudo-Label Fine-Tuning of Transformer Language Models

Towards a Benchmarking System for Comparing Automatic Hate Speech Detection with an Intelligent Baseline Proposal

Hate Speech Detection Using Text Mining and Machine Learning

Large Comparative Study of Recent Computational Approach in Automatic Hate Speech Detection

Detecting Hate Speech on Twitter Network using Ensemble Machine Learning

Combating hate speech using an adaptive ensemble learning model with a case study on COVID-19

Emotionally Informed Hate Speech Detection: A Multi-target Perspective

A joint learning approach with knowledge injection for zero-shot cross-lingual hate speech detection

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Automatic Hate Speech Detection Research Articles

Related Topics

Articles published on Automatic Hate Speech Detection

Roman urdu hate speech detection using hybrid machine learning models and hyperparameter optimization.

Automatic hate speech detection in audio using machine learning algorithms

Automatic Hate Speech Detection and the hassle of Offensive Language

Hate Speech Detection in Arabic Text: Survey

T5 for Hate Speech, Augmented Data, and Ensemble

An approach to automatic classification of hate speech in sports domain on social media

Accelerating automatic hate speech detection using parallelized ensemble learning models

Exploring Automatic Hate Speech Detection on Social Media: A Focus on Content-Based Analysis

Automated offensive language classification through “emotional” comments from network users

Assessing the Impact of Contextual Information in Hate Speech Detection

Automatic hate speech detection using aspect based feature extraction and Bi-LSTM model

Quantifying the impact of context on the quality of manual hate speech annotation

Improving Zero-Shot Cross-Lingual Hate Speech Detection with Pseudo-Label Fine-Tuning of Transformer Language Models

Towards a Benchmarking System for Comparing Automatic Hate Speech Detection with an Intelligent Baseline Proposal

Hate Speech Detection Using Text Mining and Machine Learning

Large Comparative Study of Recent Computational Approach in Automatic Hate Speech Detection

Detecting Hate Speech on Twitter Network using Ensemble Machine Learning

Combating hate speech using an adaptive ensemble learning model with a case study on COVID-19

Emotionally Informed Hate Speech Detection: A Multi-target Perspective

A joint learning approach with knowledge injection for zero-shot cross-lingual hate speech detection