GNN-Based Multimodal Named Entity Recognition

Yunchao Gong,Xindong You,Feng Hu,Yuzhong Chen,Zhu Yuan,Xueqiang Lv

doi:10.1093/comjnl/bxae030

Abstract

Abstract The Multimodal Named Entity Recognition (MNER) task enhances the text representations and improves the accuracy and robustness of named entity recognition by leveraging visual information from images. However, previous methods have two limitations: (i) the semantic mismatch between text and image modalities makes it challenging to establish accurate internal connections between words and visual representations. Besides, the limited number of characters in social media posts leads to semantic and contextual ambiguity, further exacerbating the semantic mismatch between modalities. (ii) Existing methods employ cross-modal attention mechanisms to facilitate interaction and fusion between different modalities, overlooking fine-grained correspondences between semantic units of text and images. To alleviate these issues, we propose a graph neural network approach for MNER (GNN-MNER), which promotes fine-grained alignment and interaction between semantic units of different modalities. Specifically, to mitigate the issue of semantic mismatch between modalities, we construct corresponding graph structures for text and images, and leverage graph convolutional networks to augment text and visual representations. For the second issue, we propose a multimodal interaction graph to explicitly represent the fine-grained semantic correspondences between text and visual objects. Based on this graph, we implement deep-level feature fusion between modalities utilizing graph attention networks. Compared with existing methods, our approach is the first to extend graph deep learning throughout the MNER task. Extensive experiments on the Twitter multimodal datasets validate the effectiveness of our GNN-MNER.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

GNN-Based Multimodal Named Entity Recognition

Abstract

Talk to us

Similar Papers

More From: The Computer Journal

Lead the way for us

Similar Papers

Chemical understanding and graphing skills in an honors case‐based computerized chemistry laboratory environment: The value of bidirectional visual and textual representations
Yehudit J Dori ... Irit Sasson
Journal of Research in Science Teaching | VOL. 45
Yehudit J Dori, et. al.Yehudit J Dori ... Irit Sasson
15 Jan 2008
Journal of Research in Science Teaching | VOL. 45

ACE-ADP: Adversarial Contextual Embeddings Based Named Entity Recognition for Agricultural Diseases and Pests
Xuchao Guo ... Zhao Bai
Agriculture | VOL. 11
Xuchao Guo, et. al.Xuchao Guo ... Zhao Bai
24 Sep 2021
Agriculture | VOL. 11

Towards Hierarchical Categorical Named Entity Recognition via Injective Multi-task Learning Strategy
Yuan Xiaoguang ... Cheng Yaokai
-
Yuan Xiaoguang, et. al.Yuan Xiaoguang ... Cheng Yaokai
24 Aug 2022
24 Aug 2022

1 - Unified neural architecture for drug, disease, and clinical entity recognition
Sunil Kumar Sahu ... Ashish Anand
Deep Learning Techniques for Biomedical and Health Informatics | VOL. -
Sunil Kumar Sahu, et. al.Sunil Kumar Sahu ... Ashish Anand
01 Jan 2020
Deep Learning Techniques for Biomedical and Health Informatics | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

GNN-Based Multimodal Named Entity Recognition

Abstract

Talk to us

Similar Papers

More From: The Computer Journal