A Novel Knowledge-augmented Model Customization Approach for Arabic Offensive Language Detection

Fatemah Husain

doi:10.1145/3634702

Abstract

Multiple attempts to develop systems for detecting online Arabic offensive language have been explored in previous studies. However, most of these attempts do not consider the variation of Arabic dialects, cultures, and offensive phrases. In contrast, this study aims to extract knowledge from multiple offensive language datasets to build a cross-dialect and culture knowledge-based repository. This knowledge-based repository is utilized to develop novel system architecture based on customizing the AraBERT model in a unique method to preserve dialectal knowledge and offensive cultural knowledge within the contextual word embedding of BERT architecture. Performance evaluation procedures consist of statistical evaluation metrics and a behavioral checklist. Results report more effective predictions by the customized model than the uncustomized one, particularly for offensive text. The customization process allows the model to gain more knowledge of informal text in general, and a better understanding of dialectal Arabic.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Novel Knowledge-augmented Model Customization Approach for Arabic Offensive Language Detection

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing

Lead the way for us

Journal: ACM Transactions on Asian and Low-Resource Language Information Processing	Publication Date: Dec 19, 2023
License type: cc-by-nc

Similar Papers

A Survey of Offensive Language Detection for the Arabic Language
Fatemah Husain ... Ozlem Uzuner
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 20
Fatemah Husain, et. al.Fatemah Husain ... Ozlem Uzuner
31 Jan 2021
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 20

AHMED B. İBRÂHÎM ES-SERÛCÎ VE TUHFETU’L-ASHÂB VE NUZHETU ZEVÎL-ELBÂB ADLI ESERİ
Sedef Güler
Nüsha Şarkiyat Araştırmaları Dergisi | VOL. 20
Sedef GülerSedef Güler
30 Jun 2020
Nüsha Şarkiyat Araştırmaları Dergisi | VOL. 20

Characterization and mechanical properties of offensive language taxonomy and detection techniques
S.V Kogilavani ... S Malliga
Materials Today: Proceedings | VOL. 81
S.V Kogilavani, et. al.S.V Kogilavani ... S Malliga
20 May 2021
Materials Today: Proceedings | VOL. 81

Deep learning-based approaches for abusive content detection and classification for multi-class online user-generated data
Simrat Kaur ... Sakshi Kaushal
International Journal of Cognitive Computing in Engineering | VOL. 5
Simrat Kaur, et. al.Simrat Kaur ... Sakshi Kaushal
01 Jan 2024
International Journal of Cognitive Computing in Engineering | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Novel Knowledge-augmented Model Customization Approach for Arabic Offensive Language Detection

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing