From context-aware to knowledge-aware: Boosting OOV tokens recognition in slot tagging with background knowledge

Keqing He,Yuanmeng Yan,Weiran Xu

doi:10.1016/j.neucom.2021.01.134

Abstract

Neural-based context-aware models for slot tagging tasks in language understanding have achieved state-of-the-art performance, especially deep contextualized models, such as ELMo, BERT. However, the presence of out-of-vocab (OOV) words significantly degrades the performance of neural-based models, especially in a few-shot scenario. In this paper, we propose a novel knowledge-aware slot tagging model to integrate contextual representation of input text and the large-scale lexical background knowledge. Besides, we use multi-level graph attention to explicitly reason via lexical relations. We aim to leverage both linguistic regularities covered by deep language models (LM) and high-quality background knowledge derived from curated knowledge bases (KB). Consequently, our model could infer rare and unseen words in the test dataset by incorporating contextual semantics learned from the training dataset and lexical relations from ontology. The experiments show that our proposed knowledge integration mechanism achieves consistent improvements across settings with different sizes of training data on two public benchmark datasets. We also show through detailed analysis that incorporating background knowledge effectively alleviates issues of data scarcity.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

From context-aware to knowledge-aware: Boosting OOV tokens recognition in slot tagging with background knowledge

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Mar 2, 2021
Citations: 6

Similar Papers

Learning to Tag OOV Tokens by Integrating Contextual Representation and Background Knowledge
Keqing He ... Weiran Xu
-
Keqing He, et. al.Keqing He ... Weiran Xu
01 Jan 2020
01 Jan 2020

Framework for Deep Learning-Based Language Models Using Multi-Task Learning in Natural Language Understanding: A Systematic Literature Review and Future Directions
Rahul Manohar Samant ... Shilpa Gite
IEEE Access | VOL. 10
Rahul Manohar Samant, et. al.Rahul Manohar Samant ... Shilpa Gite
01 Jan 2021
IEEE Access | VOL. 10

Dimensionality and Ramping: Signatures of Sentence Integration in the Dynamics of Brains and Deep Language Models.
Théo Desbordes ... Maxime Oquab
The Journal of Neuroscience | VOL. 43
Théo Desbordes, et. al.Théo Desbordes ... Maxime Oquab
22 May 2023
The Journal of Neuroscience | VOL. 43

Real-Time Social Media Analytics with Deep Transformer Language Models: A Big Data Approach
Ahmed Ahmet ... Tariq Abdullah
-
Ahmed Ahmet, et. al.Ahmed Ahmet ... Tariq Abdullah
01 Dec 2020
01 Dec 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

From context-aware to knowledge-aware: Boosting OOV tokens recognition in slot tagging with background knowledge

Abstract

Talk to us

Similar Papers

More From: Neurocomputing