Offensive Language Detection Research Articles

Social media serves as a platform for netizens to stay informed and express their opinions through the Internet. Currently, the social media discourse environment faces a significant security threat—offensive comments. A group of users posts comments that are provocative, discriminatory, and objectionable, intending to disrupt online discussions, provoke others, and incite intergroup conflict. These comments undermine citizens’ legitimate rights, disrupt social order, and may even lead to real-world violent incidents. However, current automatic detection of offensive language primarily focuses on a few high-resource languages, leaving low-resource languages, such as Malay, with insufficient annotated corpora for effective detection. To address this, we propose a zero-shot, cross-language unsupervised offensive language detection (OLD) method using a dual-branch mBERT transfer approach. Firstly, using the multi-language BERT (mBERT) model as the foundational language model, the first network branch automatically extracts features from both source and target domain data. Subsequently, Sinkhorn distance is employed to measure the discrepancy between the source and target language feature representations. By estimating the Sinkhorn distance between the labeled source language (e.g., English) and the unlabeled target language (e.g., Malay) feature representations, the method minimizes the Sinkhorn distance adversarially to provide more stable gradients, thereby extracting effective domain-shared features. Finally, offensive pivot words from the source and target language training sets are identified. These pivot words are then removed from the training data in a second network branch, which employs the same architecture. This process constructs an auxiliary OLD task. By concealing offensive pivot words in the training data, the model reduces overfitting and enhances robustness to the target language. In the end-to-end framework training, the combination of cross-lingual shared features and independent features culminates in unsupervised detection of offensive speech in the target language. The experimental results demonstrate that employing cross-language model transfer learning can achieve unsupervised detection of offensive content in low-resource languages. The number of labeled samples in the source language is positively correlated with transfer performance, and a greater similarity between the source and target languages leads to better transfer effects. The proposed method achieves the best performance in OLD on the Malay dataset, achieving an F1 score of 80.7%. It accurately identifies features of offensive speech, such as sarcasm, mockery, and implicit expressions, and showcases strong generalization and excellent stability across different target languages.

Read full abstract

THIS ARTICLE USES WORDS OR LANGUAGE THAT IS CONSIDERED PROFANE, VULGAR, OR OFFENSIVE BY SOME READERS. Different types of abusive content such as offensive language, hate speech, aggression, etc. have become prevalent in social media and many efforts have been dedicated to automatically detect this phenomenon in different resource-rich languages such as English. This is mainly due to the comparative lack of annotated data related to offensive language in low-resource languages, especially the ones spoken in Asian countries. To reduce the vulnerability among social media users from these regions, it is crucial to address the problem of offensive language in such low-resource languages. Hence, we present a new corpus of Persian offensive language consisting of 6,000 out of 520,000 randomly sampled micro-blog posts from X (Twitter) to deal with offensive language detection in Persian as a low-resource language in this area. We introduce a method for creating the corpus and annotating it according to the annotation practices of recent efforts for some benchmark datasets in other languages which results in categorizing offensive language and the target of offense as well. We perform extensive experiments with three classifiers in different levels of annotation with a number of classical Machine Learning (ML), Deep learning (DL), and transformer-based neural networks including monolingual and multilingual pre-trained language models. Furthermore, we propose an ensemble model integrating the aforementioned models to boost the performance of our offensive language detection task. Initial results on single models indicate that SVM trained on character or word n-grams are the best performing models accompanying monolingual transformer-based pre-trained language model ParsBERT in identifying offensive vs non-offensive content, targeted vs untargeted offense, and offensive towards individual or group. In addition, the stacking ensemble model outperforms the single models by a substantial margin, obtaining 5% respective macro F1-score improvement for three levels of annotation.

Read full abstract

Offensive Language Detection Research Articles

Articles published on Offensive Language Detection

Advancing offensive language detection in Arabic social media: a BERT-based ensemble learning approach

DÉDUCTION EFFICACE DE LANGUE OFFENSIVE UTILISANT L'APPRENTISSAGE PROFONDE DANS LES MÉDIAS SOCIAUX

OLF-ML: An Offensive Language Framework for Detection, Categorization, and Offense Target Identification Using Text Processing and Machine Learning Algorithms

Detecting Offensive Language on Malay Social Media: A Zero-Shot, Cross-Language Transfer Approach Using Dual-Branch mBERT

Towards Optimal NLP Solutions: Analyzing GPT and LLaMA-2 Models Across Model Scale, Dataset Size, and Task Diversity

A comprehensive review on Arabic offensive language and hate speech detection on social media: methods, challenges and solutions

Advancing NLP models with strategic text augmentation: A comprehensive study of augmentation methods and curriculum strategies

Enhancing Arabic offensive language detection with BERT-BiGRU model

A survey on multi-lingual offensive language detection.

An inter-modal attention-based deep learning framework using unified modality for multimodal fake news, hate speech and offensive language detection

Filtering offensive language from multilingual social media contents: A deep learning approach

Detecting Offensive Language Based on Graph Attention Networks and Fusion Features

Offensive language and hate speech detection using deep learning in football news live streaming chat on YouTube in Thailand

Offensive language detection in low resource languages: A use case of Persian language.

Deep learning-based approaches for abusive content detection and classification for multi-class online user-generated data

Elevating Offensive Language Detection: CNN-GRU and BERT for Enhanced Hate Speech Identification

Offensive Language Detection for Low Resource Language Using Deep Sequence Model

NA

A Novel Knowledge-augmented Model Customization Approach for Arabic Offensive Language Detection

Hebrew offensive language taxonomy and dataset

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Offensive Language Detection Research Articles

Articles published on Offensive Language Detection

Advancing offensive language detection in Arabic social media: a BERT-based ensemble learning approach

DÉDUCTION EFFICACE DE LANGUE OFFENSIVE UTILISANT L'APPRENTISSAGE PROFONDE DANS LES MÉDIAS SOCIAUX

OLF-ML: An Offensive Language Framework for Detection, Categorization, and Offense Target Identification Using Text Processing and Machine Learning Algorithms

Detecting Offensive Language on Malay Social Media: A Zero-Shot, Cross-Language Transfer Approach Using Dual-Branch mBERT

Towards Optimal NLP Solutions: Analyzing GPT and LLaMA-2 Models Across Model Scale, Dataset Size, and Task Diversity

A comprehensive review on Arabic offensive language and hate speech detection on social media: methods, challenges and solutions

Advancing NLP models with strategic text augmentation: A comprehensive study of augmentation methods and curriculum strategies

Enhancing Arabic offensive language detection with BERT-BiGRU model

A survey on multi-lingual offensive language detection.

An inter-modal attention-based deep learning framework using unified modality for multimodal fake news, hate speech and offensive language detection

Filtering offensive language from multilingual social media contents: A deep learning approach

Detecting Offensive Language Based on Graph Attention Networks and Fusion Features

Offensive language and hate speech detection using deep learning in football news live streaming chat on YouTube in Thailand

Offensive language detection in low resource languages: A use case of Persian language.

Deep learning-based approaches for abusive content detection and classification for multi-class online user-generated data

Elevating Offensive Language Detection: CNN-GRU and BERT for Enhanced Hate Speech Identification

Offensive Language Detection for Low Resource Language Using Deep Sequence Model

NA

A Novel Knowledge-augmented Model Customization Approach for Arabic Offensive Language Detection

Hebrew offensive language taxonomy and dataset