Abstract
Hate speech and Offensive Posts (OP) detection on Smart Multimedia Internet of Things (MIoT) have been an active issue for researchers. MIoT media texts in non-native English-speaking countries are often code-mixed or script mixed/switched. This paper proposes an ensemble-based Deep Learning (DL) framework comprised of a Convolutional Neural Network (CNN) and a Dense Neural Network (DNN) for identifying hate and OP in Malayalam Code-Mixed (MCM), Tamil Code-Mixed (TCM), and Malayalam Script-Mixed (MSM) MIoT media postings. Word-level and character-level features are utilized in the convolutional neural network. In contrast, the dense neural network uses character-level Term Frequency-Inverse Document Frequency (TF-IDF) features. The inclusion of character-level features in the proposed ensemble framework resulted in state-of-the-art performance for TCM and MCM datasets, with weighted F 1 -score of 0.91 and 0.78, respectively, and comparable performance for MSM posts, with a weighted F 1 -score of 0.95.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: ACM Transactions on Asian and Low-Resource Language Information Processing
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.