FALCoN: Detecting and classifying abusive language in social networks using context features and unlabeled data

Suppawong Tuarob,Manisa Satravisut,Pochara Sangtunchai,Sakunrat Nunthavanich,Thanapon Noraset

doi:10.1016/j.ipm.2023.103381

Suppawong Tuarob, Manisa Satravisut + Show 3 more

https://doi.org/10.1016/j.ipm.2023.103381

Copy DOI

Abstract

Social networks have grown into a widespread form of communication that allows a large number of users to participate in conversations and consume information at any time. The casual nature of social media allows for nonstandard terminology, some of which may be considered rude and derogatory. As a result, a significant portion of social media users is found to express disrespectful language. This problem may intensify in certain developing countries where young children are granted unsupervised access to social media platforms. Furthermore, the sheer amount of social media data generated daily by millions of users makes it impractical for humans to monitor and regulate inappropriate content. If adolescents are exposed to these harmful language patterns without adequate supervision, they may feel obliged to adopt them. In addition, unrestricted aggression in online forums may result in cyberbullying and other dreadful occurrences. While computational linguistics research has addressed the difficulty of detecting abusive dialogues, issues remain unanswered for low-resource languages with little annotated data, leading the majority of supervised techniques to perform poorly. In addition, social media content is often presented in complex, context-rich formats that encourage creative user involvement. Therefore, we propose to improve the performance of abusive language detection and classification in a low-resource setting, using both the abundant unlabeled data and the context features via the co-training protocol that enables two machine learning models, each learning from an orthogonal set of features, to teach each other, resulting in an overall performance improvement. Empirical results reveal that our proposed framework achieves F1 values of 0.922 and 0.827, surpassing the state-of-the-art baselines by 3.32% and 45.85% for binary and fine-grained classification tasks, respectively. In addition to proving the efficacy of co-training in a low-resource situation for abusive language detection and classification tasks, the findings shed light on several opportunities to use unlabeled data and contextual characteristics of social networks in a variety of social computing applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

FALCoN: Detecting and classifying abusive language in social networks using context features and unlabeled data

Abstract

Talk to us

Similar Papers

More From: Information Processing & Management

Lead the way for us

Journal: Information Processing & Management	Publication Date: Apr 27, 2023
Citations: 7

Similar Papers

Session details: SONAMA - social network and media analysis track
-
-
--
03 Apr 2017
03 Apr 2017

Measuring and mitigating language model biases in abusive language detection
Rui Song ... Hao Xu
Information Processing & Management | VOL. 60
Rui Song, et. al.Rui Song ... Hao Xu
07 Feb 2023
Information Processing & Management | VOL. 60

Improving Abusive Language Detection with online interaction network
Rui Song ... Hao Xu
Information Processing & Management | VOL. 59
Rui Song, et. al.Rui Song ... Hao Xu
08 Jul 2022
Information Processing & Management | VOL. 59

Abusive Language Detection: A Comprehensive Review
Usman Naseem ... Farasat Ali
Indian Journal of Science and Technology | VOL. 12
Usman Naseem, et. al.Usman Naseem ... Farasat Ali
10 Dec 2019
Indian Journal of Science and Technology | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

FALCoN: Detecting and classifying abusive language in social networks using context features and unlabeled data

Abstract

Talk to us

Similar Papers

More From: Information Processing &amp; Management

More From: Information Processing & Management