Targeted Aspect-Based Multimodal Sentiment Analysis: An Attention Capsule Extraction and Multi-Head Fusion Network

Donghong Gu,Shaohua Cai,Hua Wang,Jiaqian Wang,Haoliang Zhao,Luwei Xiao,Chi Yang,Zhengxin Song

doi:10.1109/access.2021.3126782

Donghong Gu, Shaohua Cai + Show 6 more

Open Access

https://doi.org/10.1109/access.2021.3126782

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 20	License type: CC BY 4.0

Affiliation: South China Normal University

Abstract

Multimodal sentiment analysis has currently identified its significance in a variety of domains. For the purpose of sentiment analysis, different aspects of distinguishing modalities, which correspond to one target, are processed and analyzed. In this work, we propose the targeted aspect-based multimodal sentiment analysis (TABMSA) for the first time. Furthermore, an attention capsule extraction and multi-head fusion network (EF-Net) on the task of TABMSA is devised. The multi-head attention (MHA) based network and the ResNet-152 are employed to deal with texts and images, respectively. The integration of MHA and capsule network aims to capture the interaction among the multimodal inputs. In addition to the targeted aspect, the information from the context and the image is also incorporated for sentiment delivered. We evaluate the proposed model on two manually annotated datasets. the experimental results demonstrate the effectiveness of our proposed model for this new task.

Highlights

Sentiment analysis, referred to as sentiment classification, aims to extract opinions from a large number of unstructured texts and classifying them into sentiment polarities, positive, neutral or negative [1]
On current shopping and social platforms, seeing that the text and image information is taken to mutually reinforce and complement each other, models are dedicatedly devised to classify the sentiment polarity by using both kinds of data and their latent relation [5]. Recent publications report their achievements on the task of multimodal sentiment analysis
Unlike the previous approach of bilinear pooling, we use multi-head attention network for multimodal feature fusion, because the multi-head attention mechanism can focus on the interaction of textual and visual modality in different facet, This helps the model to capture more inter-modality correlation information

Summary

INTRODUCTION

Referred to as sentiment classification, aims to extract opinions from a large number of unstructured texts and classifying them into sentiment polarities, positive, neutral or negative [1]. On current shopping and social platforms, seeing that the text and image information is taken to mutually reinforce and complement each other, models are dedicatedly devised to classify the sentiment polarity by using both kinds of data and their latent relation [5] Recent publications report their achievements on the task of multimodal sentiment analysis. Unlike the previous approach of bilinear pooling, we use multi-head attention network for multimodal feature fusion, because the multi-head attention mechanism can focus on the interaction of textual and visual modality in different facet, This helps the model to capture more inter-modality correlation information. Yu et al proposed a Multimodal BERT architecture, which adapts BERT for cross-modal interaction to obtain target-sensitive textual/visual representations and utilize stacked multiple self-attention layers to achieve multi-modal fusion[5]. Where X indicates a general input of the MHA network

FEATURE EXTRACTING LAYER

Method

CASE STUDY

Findings

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Targeted Aspect-Based Multimodal Sentiment Analysis: An Attention Capsule Extraction and Multi-Head Fusion Network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Self-adaptive attention fusion for multimodal aspect-based sentiment analysis.
Ziyue Wang ... Junjun Guo
Mathematical biosciences and engineering : MBE | VOL. 21
Ziyue Wang, et. al.Ziyue Wang ... Junjun Guo
01 Jan 2023
Mathematical biosciences and engineering : MBE | VOL. 21

Multi-Interactive Memory Network for Aspect Based Multimodal Sentiment Analysis
Nan Xu ... Wenji Mao
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 33
Nan Xu, et. al.Nan Xu ... Wenji Mao
17 Jul 2019
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 33

Multimodal sentiment analysis of intangible cultural heritage songs with strengthened audio features-guided attention
Tao Fan ... Hao Wang
Journal of Information Science | VOL. -
Tao Fan, et. al.Tao Fan ... Hao Wang
08 Sep 2022
Journal of Information Science | VOL. -

Multi-task Learning for Multi-modal Emotion Recognition and Sentiment Analysis
Md Shad Akhtar ... Dushyant Chauhan
-
Md Shad Akhtar, et. al.Md Shad Akhtar ... Dushyant Chauhan
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Targeted Aspect-Based Multimodal Sentiment Analysis: An Attention Capsule Extraction and Multi-Head Fusion Network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access