Fuser: An enhanced multimodal fusion framework with congruent reinforced perceptron for hateful memes detection

Fan Wu,Zhengjun Liu,Xiaoou Pan,Linlin Li,Bin Gao,Yujiao Ma,Shutian Liu

doi:10.1016/j.ipm.2024.103772

Abstract

As a multimodal form of hate speech on social media, hateful memes are more aggressive and cryptic threats to the real life of humans. Automatic detection of hateful memes is crucial, but the images and texts in most memes are only weakly consistent or even irrelevant. Although existing works have achieved the initial goal of detecting hateful memes with pre-trained models, they are limited to monolithic inference methods while ignoring the semantic differences between multimodal representations. To strengthen the comprehension and reasoning of the hidden meaning behind the memes by combining real-world knowledge, we propose an enhanced multimodal fusion framework with congruent reinforced perceptron for hateful memes detection. Inspired by the human cognitive mechanism, we first divide the extracted multisource representations into main semantics and auxiliary contexts based on their strength and relevance, and then precode them into lightly correlated embeddings with unified spatial dimensions via a novel prefix uniform layer, respectively. To jointly learn the intrinsic correlation between primary and secondary semantics, a congruent reinforced perceptron with brain-like perceptual integration is designed to seamlessly fuse multimodal representations in a shared latent space while maintaining the feature integrity in the sub-fusion space, thereby implicitly reasoning about the subtle metaphors behind the memes. Extensive experiments on four benchmark datasets fully demonstrate the effectiveness and superiority of our architecture compared with previous state-of-the-art methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fuser: An enhanced multimodal fusion framework with congruent reinforced perceptron for hateful memes detection

Abstract

Talk to us

Similar Papers

More From: Information Processing & Management

Lead the way for us

Similar Papers

Form of Hate Speech Comments on Najwa Shihab Youtube Channels in The General Election Campaign of President and Vice President of The Republic of Indonesia 2019
Rahayu Pristiwati ... Tsalisa Yuliyanti
Seloka: Jurnal Pendidikan Bahasa dan Sastra Indonesia | VOL. 9
Rahayu Pristiwati, et. al.Rahayu Pristiwati ... Tsalisa Yuliyanti
31 Dec 2020
Seloka: Jurnal Pendidikan Bahasa dan Sastra Indonesia | VOL. 9

Clickbait Detection in Telugu: Overcoming NLP Challenges in Resource-Poor Languages using Benchmarked Techniques
Mounika Marreddy ... Lakshmi Sireesha Vakada
-
Mounika Marreddy, et. al.Mounika Marreddy ... Lakshmi Sireesha Vakada
18 Jul 2021
18 Jul 2021

Stability and disruptive speech
Carl Fox
Journal of Social Philosophy | VOL. -
Carl FoxCarl Fox
07 Mar 2023
Journal of Social Philosophy | VOL. -

A Comprehensive Survey on Multimodal Data Representation and Information Fusion Algorithms
Apeksha Gaonkar ... P Jahnavi Raman
-
Apeksha Gaonkar, et. al.Apeksha Gaonkar ... P Jahnavi Raman
25 Jun 2021
25 Jun 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fuser: An enhanced multimodal fusion framework with congruent reinforced perceptron for hateful memes detection

Abstract

Talk to us

Similar Papers

More From: Information Processing &amp; Management

More From: Information Processing & Management