Multimodal Hateful Meme Classification Based on Transfer Learning and a Cross-Mask Mechanism

Fan Wu,Guolian Chen,Junkuo Cao,Yuhan Yan,Zhongneng Li

doi:10.3390/electronics13142780

Abstract

Hateful memes are malicious and biased sentiment information widely spread on the internet. Detecting hateful memes differs from traditional multimodal tasks because, in conventional tasks, visual and textual information align semantically. However, the challenge in detecting hateful memes lies in their unique multimodal nature, where images and text in memes may be weak or unrelated, requiring models to understand the content and perform multimodal reasoning. To address this issue, we introduce a multimodal fine-grained hateful memes detection model named “TCAM”. The model leverages advanced encoding techniques from TweetEval and CLIP and introduces enhanced Cross-Attention and Cross-Mask Mechanisms (CAM) in the feature fusion stage to improve multimodal correlations. It effectively embeds fine-grained features of data and image descriptions into the model through transfer learning. This paper uses the Area Under the Receiver Operating Characteristic Curve (AUROC) as the primary metric to evaluate the model’s discriminatory ability. This approach achieved an AUROC score of 0.8362 and an accuracy score of 0.764 on the Facebook Hateful Memes Challenge (FHMC) dataset, confirming its high discriminatory capability. The TCAM model demonstrates relatively superior performance compared to ensemble machine learning methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multimodal Hateful Meme Classification Based on Transfer Learning and a Cross-Mask Mechanism

Abstract

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Journal: Electronics	Publication Date: Jul 15, 2024
License type: CC BY 4.0

Similar Papers

Predicting Systemic Health Features from Retinal Fundus Images Using Transfer-Learning-Based Artificial Intelligence Models.
Nergis C Khan ... Karen M Chen
Diagnostics | VOL. 12
Nergis C Khan, et. al.Nergis C Khan ... Karen M Chen
14 Jul 2022
Diagnostics | VOL. 12

The Conduct and Reporting of Meta-Analyses of Studies of Diagnostic Tests, and a Consideration of ROC Curves: Answers to the January 2010 Journal Club Questions
Teri A Reynolds ... David L Schriger
Annals of Emergency Medicine | VOL. 55
Teri A Reynolds, et. al.Teri A Reynolds ... David L Schriger
21 May 2010
Annals of Emergency Medicine | VOL. 55

Comparison of Two or More Correlated AUCs in Paired Sample Design

Journal of Natural Sciences Research | VOL. 9

01 May 2019
Journal of Natural Sciences Research | VOL. 9

Computer-Based Classification of Dermoscopy Images of Melanocytic Lesions on Acral Volar Skin
Hitoshi Iyatomi ... Masaru Tanaka
Journal of Investigative Dermatology | VOL. 128
Hitoshi Iyatomi, et. al.Hitoshi Iyatomi ... Masaru Tanaka
01 Aug 2008
Journal of Investigative Dermatology | VOL. 128

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multimodal Hateful Meme Classification Based on Transfer Learning and a Cross-Mask Mechanism

Abstract

Talk to us

Similar Papers

More From: Electronics