Synergistic Detection of Multimodal Fake News Leveraging TextGCN and Vision Transformer

Visweswaran M,Jayanth Mohan,S Sachin Kumar,K P Soman

doi:10.1016/j.procs.2024.04.017

Abstract

In today's digital age, the rapid spread of fake news is a pressing concern. Fake news, whether intentional or inadvertent, manipulates public sentiment and threatens the integrity of online information. To address this, effective detection and prevention methods are vital. Detecting and addressing this multimodal fake news is an intricate challenge as, unlike traditional news articles that predominantly rely on textual content, multimodal fake news leverages the persuasive power of visual elements, making its identification a formidable task. Manipulated images can significantly sway individuals' perceptions and beliefs, making the detection of such deceptive content complex. Our research introduces an innovative approach to multimodal fake news identification by presenting a fusion-based methodology that harnesses the capabilities of Text Graph Convolutional Neural Networks (TextGCN) and Vision Transformers (ViT) to effectively utilise both text and image modalities. The proposed Methodology starts with preprocessing textual content using TextGCN, allowing for the capture of intricate structural dependencies among words and phrases. Simultaneously, visual features are extracted from associated images using ViT. Through a fusion mechanism, these modalities seamlessly integrate, yielding superior embeddings. The primary contributions encompass an in-depth exploration of multimodal fake news detection through a fusion-based approach. What sets our approach apart from existing techniques is its integration of graph-based feature extraction through TextGCN. While previous methods predominantly rely on text or image features, our approach harnesses the additional semantic information and intricate relationships within a graph structure, in addition to image embeddings. This enables our method to capture more comprehensive understanding of the data, resulting in increased accuracy and reliability. Our experiments demonstrate the exceptional performance of our fusion-based approach, which leverages multiple modalities and incorporates graph-based representations and semantic relationships. This method outperformed single modalities of text or image, achieving an impressive accuracy of 94.17% using a neural network after fusion. By seamlessly integrating graph-based representations and semantic relationships, our fusion-based technique represents a significant stride in addressing the challenges posed by multimodal fake news.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Synergistic Detection of Multimodal Fake News Leveraging TextGCN and Vision Transformer

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science

Lead the way for us

Journal: Procedia Computer Science	Publication Date: Jan 1, 2024
License type: cc-by

Similar Papers

Leveraging Heterogeneous Data for Fake News Detection
K Anoop ... V L Lajish
-
K Anoop, et. al.K Anoop ... V L Lajish
27 Nov 2018
27 Nov 2018

Detection of Fake News Using Transformer Model
Momina Qazi ... Mazhar Ali
-
Momina Qazi, et. al.Momina Qazi ... Mazhar Ali
01 Jan 2020
01 Jan 2020

Detecting fake news stories via multimodal analysis
Vivek K Singh ... Darshan Sonagara
Journal of the Association for Information Science and Technology | VOL. 72
Vivek K Singh, et. al.Vivek K Singh ... Darshan Sonagara
04 May 2020
Journal of the Association for Information Science and Technology | VOL. 72

GBCA: Graph Convolution Network and BERT combined with Co-Attention for fake news detection
Zhen Zhang ... Guohua Wu
Pattern Recognition Letters | VOL. 180
Zhen Zhang, et. al.Zhen Zhang ... Guohua Wu
23 Feb 2024
Pattern Recognition Letters | VOL. 180

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Synergistic Detection of Multimodal Fake News Leveraging TextGCN and Vision Transformer

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science