FMFN: Fine-Grained Multimodal Fusion Networks for Fake News Detection

Jingzi Wang,Hongyan Mao,Hongwei Li

doi:10.3390/app12031093

Jingzi Wang, Hongyan Mao + Show 1 more

Open Access

https://doi.org/10.3390/app12031093

Copy DOI

Abstract

As one of the most popular social media platforms, microblogs are ideal places for news propagation. In microblogs, tweets with both text and images are more likely to attract attention than text-only tweets. This advantage is exploited by fake news producers to publish fake news, which has a devasting impact on individuals and society. Thus, multimodal fake news detection has attracted the attention of many researchers. For news with text and image, multimodal fake news detection utilizes both text and image information to determine the authenticity of news. Most of the existing methods for multimodal fake news detection obtain a joint representation by simply concatenating a vector representation of the text and a visual representation of the image, which ignores the dependencies between them. Although there are a small number of approaches that use the attention mechanism to fuse them, they are not fine-grained enough in feature fusion. The reason is that, for a given image, there are multiple visual features and certain correlations between these features. They do not use multiple feature vectors representing different visual features to fuse with textual features, and ignore the correlations, resulting in inadequate fusion of textual features and visual features. In this paper, we propose a novel fine-grained multimodal fusion network (FMFN) to fully fuse textual features and visual features for fake news detection. Scaled dot-product attention is utilized to fuse word embeddings of words in the text and multiple feature vectors representing different features of the image, which not only considers the correlations between different visual features but also better captures the dependencies between textual features and visual features. We conduct extensive experiments on a public Weibo dataset. Our approach achieves competitive results compared with other methods for fusing visual representation and text representation, which demonstrates that the joint representation learned by the FMFN (which fuses multiple visual features and multiple textual features) is better than the joint representation obtained by fusing a visual representation and a text representation in determining fake news.

Highlights

With the rapid development of social networks, social media platforms have become ideal places for news propagation [1]
We use deep convolutional neural networks (CNNs) to extract multiple visual features of a given image and RoBERTa [8] to obtain deep contextualized word embeddings of words, each of which can be considered as a textual feature
The feature fusion in our method is entirely based on the scaled dot-product attention, and the proposed method is expected to improve the performance of fake news detection

Summary

Introduction

With the rapid development of social networks, social media platforms have become ideal places for news propagation [1]. As one of the most popular social media platforms, microblogs, such as Twitter and Weibo, allow people to share and forward tweets, where the tweets with both text and images are more likely to attract attention than the text-only tweets This advantage is exploited by fake news producers, who post tweets about fake news on microblogs by manipulating text and forging images. To overcome the limitations of the aforementioned methods, the fine-grained multimodal fusion networks (FMFN) is proposed for fake news detection. To effectively detect fake news with text and image, we propose a novel model for fine-grained fusion of textual features and visual features. We review related work on fake news detection and scaled dot-product attention.

Related Work

Unimodal Fake News Detection

Multimodal Fake News Detection

Scaled-Dot Product Attention

Visual Feature Extraction

Textual Feature Extraction

Feature Fusion

Fake News Detector and Model Learning

Settings

Baselines

Method

Comparison with Baselines

Visualization of the Joint Representation

Findings

Conclusions and Future Work

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied sciences	Publication Date: Jan 21, 2022
Citations: 25	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

FMFN: Fine-Grained Multimodal Fusion Networks for Fake News Detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied sciences

Lead the way for us

Similar Papers

Using ALBERT and Multi-modal Circulant Fusion for Fake News Detection
Xingang Wang ... Xiaoyu Liu
-
Xingang Wang, et. al.Xingang Wang ... Xiaoyu Liu
09 Oct 2022
09 Oct 2022

SceneFND: Multimodal fake news detection by modelling scene context information
Guobiao Zhang ... Anastasia Giachanou
Journal of Information Science | VOL. 50
Guobiao Zhang, et. al.Guobiao Zhang ... Anastasia Giachanou
23 Apr 2022
Journal of Information Science | VOL. 50

Detecting fake news stories via multimodal analysis
Vivek K Singh ... Isha Ghosh
Journal of the American Society for Information Science and Technology | VOL. 72
Vivek K Singh, et. al.Vivek K Singh ... Isha Ghosh
04 May 2020
Journal of the American Society for Information Science and Technology | VOL. 72

AMFB: Attention based multimodal Factorized Bilinear Pooling for multimodal Fake News Detection
Rina Kumari ... Asif Ekbal
Expert systems with applications | VOL. 184
Rina Kumari, et. al.Rina Kumari ... Asif Ekbal
29 Jun 2021
Expert systems with applications | VOL. 184

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

FMFN: Fine-Grained Multimodal Fusion Networks for Fake News Detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied sciences