Offensive Language In Social Media Research Articles

Offensive language in social media affects the social experience of individuals and groups and hurts social harmony and moral values. Therefore, in recent years, the problem of offensive language detection has attracted the attention of many researchers. However, the primary research currently focuses on detecting English offensive language, while few studies on the Chinese language exist. In this paper, we propose an innovative approach to detect Chinese offensive language. First, unlike previous approaches, we utilized both RoBERTa’s sentence-level and word-level embedding, combining the sentence embedding and word embedding of RoBERTa’s model, bidirectional GRU, and multi-head self-attention mechanism. This feature fusion allows the model to consider sentence-level and word-level semantic information at the same time so as to capture the semantic information of Chinese text more comprehensively. Second, by concatenating the output results of multi-head attention with RoBERTa’s sentence embedding, we achieved an efficient fusion of local and global information and improved the representation ability of the model. The experiments showed that the proposed model achieved 82.931% accuracy and 82.842% F1-score in Chinese offensive language detection tasks, delivering high performance and broad application potential.

As the popularity of social media grows, computer-mediated anonymity allows users to engage in activities that they would not do in real life. This makes users vulnerable to abuse through Internet platforms. Due to the enormous number of social media data, it is not possible to manually filter out the overflow of abusive content in online communities and social networking sites. The research work proposes a multi-level classification model that deploys various machine and deep learning models to effectively identify offensive content in a tweet. The proposed Auto-Off ID system is designed to build a system that classifies tweets as offensive or non-offensive; filters out and classifies offensive tweets as either targeted or non-targeted; filters out targeted tweets and identify mentions of individuals and organizations who have been bullied. The study is supported by the text analysis features with lexicon features using LIWC, POS tags for primary and secondary users, Twitter Tag Scores (TTS). This system is evaluated using a diverse choice of machine learning and deep learning models from which it is proved that C-LSTM outperform with an accuracy of 91.72% for offensive language identification; LDA + Logistic Regression training with SVM accuracy of 90.87% for offensive tweet classification.

Offensive Language In Social Media Research Articles

Related Topics

Articles published on Offensive Language In Social Media

Detection of Arabic offensive language in social media using machine learning models

General Review of The Use of Offensive Language in Social Media

RB_BG_MHA: A RoBERTa-Based Model with Bi-GRU and Multi-Head Attention for Chinese Offensive Language Detection in Social Media

Offensive-Language Detection on Multi-Semantic Fusion Based on Data Augmentation

Could a Conversational AI Identify Offensive Language?

Auto-Off ID: Automatic Detection of Offensive Language in Social Media

Identifying and Detecting Offensive Language in Social Media

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Offensive Language In Social Media Research Articles

Related Topics

Articles published on Offensive Language In Social Media

Detection of Arabic offensive language in social media using machine learning models

General Review of The Use of Offensive Language in Social Media

RB_BG_MHA: A RoBERTa-Based Model with Bi-GRU and Multi-Head Attention for Chinese Offensive Language Detection in Social Media

Offensive-Language Detection on Multi-Semantic Fusion Based on Data Augmentation

Could a Conversational AI Identify Offensive Language?

Auto-Off ID: Automatic Detection of Offensive Language in Social Media

Identifying and Detecting Offensive Language in Social Media