Cross-modal fine-grained alignment and fusion network for multimodal aspect-based sentiment analysis

Luwei Xiao,Xingjiao Wu,Shuwen Yang,Junjie Xu,Jie Zhou,Liang He

doi:10.1016/j.ipm.2023.103508

Abstract

Multi-modal Aspect-based Sentiment Analysis (MABSA) aims to forecast the polarity of sentiment concerning aspects within a given sentence based on the correlation between the sentence and its accompanying image. Comprehending multi-modal sentiment expression requires strong cross-modal alignment and fusion ability. Previous state-of-the-art (SOTA) models fail to explicitly align valuable visual clues with aspect and sentiment information in textual representations and overlook the utilization of syntactic dependency information in the accompanying text modality. We present CoolNet (Cross-modal Fine-grained Alignment and Fusion Network) to boost the performance of visual-language models in seamlessly integrating vision and language information. Specifically, CoolNet starts by transforming an image into a textual caption and a graph structure, then dynamically aligns the semantic and syntactic information from both the input sentence and the generated caption, as well as models the object-level visual features. Finally, a cross-modal transformer is employed to fuse and model the inter-modality dynamics.This network boasts advanced cross-modal fine-grained alignment and fusion capabilities. On standard benchmarks such as Twitter-2015 and Twitter-2017, CoolNet consistently outperforms state-of-the-art algorithm FITE with notable improvements in accuracy and Macro-F1 values. Specifically, we observe an improvement in accuracy and Macro-F1 values by 1.43% and 1.38% for Twitter-2015, and 0.74% and 0.88% for Twitter-2017, respectively, demonstrating the superiority of our CoolNet architecture.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Cross-modal fine-grained alignment and fusion network for multimodal aspect-based sentiment analysis

Abstract

Talk to us

Similar Papers

More From: Information Processing & Management

Lead the way for us

Journal: Information Processing & Management	Publication Date: Sep 21, 2023
Citations: 12

Similar Papers

Self-adaptive attention fusion for multimodal aspect-based sentiment analysis.
Ziyue Wang ... Junjun Guo
Mathematical biosciences and engineering : MBE | VOL. 21
Ziyue Wang, et. al.Ziyue Wang ... Junjun Guo
01 Jan 2023
Mathematical biosciences and engineering : MBE | VOL. 21

Adaptive Multi-Feature Extraction Graph Convolutional Networks for Multimodal Target Sentiment Analysis
Luwei Xiao ... Tianlong Ma
-
Luwei Xiao, et. al.Luwei Xiao ... Tianlong Ma
18 Jul 2022
18 Jul 2022

Hierarchical denoising representation disentanglement and dual-channel cross-modal-context interaction for multimodal sentiment analysis
Zuhe Li ... Hao Wang
Expert Systems With Applications | VOL. 252
Zuhe Li, et. al.Zuhe Li ... Hao Wang
16 May 2024
Expert Systems With Applications | VOL. 252

AB-GRU: An attention-based bidirectional GRU model for multimodal sentiment fusion and analysis.
Jun Wu ... Ji Wang
Mathematical biosciences and engineering : MBE | VOL. 20
Jun Wu, et. al.Jun Wu ... Ji Wang
01 Jan 2023
Mathematical biosciences and engineering : MBE | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cross-modal fine-grained alignment and fusion network for multimodal aspect-based sentiment analysis

Abstract

Talk to us

Similar Papers

More From: Information Processing &amp; Management

More From: Information Processing & Management