Hierarchical Aligned Multimodal Learning for NER on Tweet Posts

Peipei Liu,Limin Sun,Hong Li,Yimo Ren,Shuaizong Si,Hongsong Zhu,Jie Liu

doi:10.1609/aaai.v38i17.29831

Abstract

Mining structured knowledge from tweets using named entity recognition (NER) can be beneficial for many downstream applications such as recommendation and intention under standing. With tweet posts tending to be multimodal, multimodal named entity recognition (MNER) has attracted more attention. In this paper, we propose a novel approach, which can dynamically align the image and text sequence and achieve the multi-level cross-modal learning to augment textual word representation for MNER improvement. To be specific, our framework can be split into three main stages: the first stage focuses on intra-modality representation learning to derive the implicit global and local knowledge of each modality, the second evaluates the relevance between the text and its accompanying image and integrates different grained visual information based on the relevance, the third enforces semantic refinement via iterative cross-modal interactions and co-attention. We conduct experiments on two open datasets, and the results and detailed analysis demonstrate the advantage of our model.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Hierarchical Aligned Multimodal Learning for NER on Tweet Posts

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Mar 24, 2024
Citations: 1

Similar Papers

Ship Fault Named Entity Recognition Based on Bilayer Bi-LSTM-CRF
Tongjia Hou ... Liang Zhou
-
Tongjia Hou, et. al.Tongjia Hou ... Liang Zhou
17 Oct 2020
17 Oct 2020

A Pipeline Approach to Context-Aware Handwritten Text Recognition
Yee Fan Tan ... Andrew Beng Jin Teoh
Applied sciences | VOL. 12
Yee Fan Tan, et. al.Yee Fan Tan ... Andrew Beng Jin Teoh
11 Feb 2022
Applied sciences | VOL. 12

Research of Clinical Named Entity Recognition Based on Bi-LSTM-CRF
Ying Qin ... Yingfei Zeng
Journal of Shanghai Jiaotong University (science) | VOL. 23
Ying Qin, et. al.Ying Qin ... Yingfei Zeng
01 Jun 2018
Journal of Shanghai Jiaotong University (science) | VOL. 23

An attention-based multi-task model for named entity recognition and intent analysis of Chinese online medical questions.
Chaochen Wu ... Guan Luo
Journal of Biomedical Informatics | VOL. 108
Chaochen Wu, et. al.Chaochen Wu ... Guan Luo
14 Jul 2020
Journal of Biomedical Informatics | VOL. 108

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hierarchical Aligned Multimodal Learning for NER on Tweet Posts

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence