Multi-level context features extraction for named entity recognition

Jun Chang,Xiaohong Han

doi:10.1016/j.csl.2022.101412

Abstract

Bidirectional long short-term memory (Bi-LSTM), as one of the effective networks for sequence labeling tasks, is widely used in named entity recognition (NER). However, the sequential nature of Bi-LSTM and the inability to recognize multiple sentences at the same time make it impossible to obtain overall information. In this paper, to make up for the shortcomings of Bi-LSTM in extracting global information, we propose a hierarchical context model embedded with sentence-level and document-level feature extraction. In sentence-level feature extraction, we use the self-attention mechanism to extract sentence-level representations considering the different contribution of each word to the sentence. For document-level feature extraction, 3D convolutional neural network (CNN), which not only can extract features within sentences, but also pays attention to the sequential relationship between sentences, is used to extract document-level representations. Furthermore, we investigate a layer-by-layer residual (LBL Residual) structure to optimize each Bi-LSTM block of our model, which can solve the degradation problem that appears as the number of model layers increases. Experiments show that our model achieves results competitive with the state-of-the-art records on the CONLL-2003 and Ontonotes5.0 English datasets respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-level context features extraction for named entity recognition

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language

Lead the way for us

Journal: Computer Speech & Language	Publication Date: Jun 1, 2022
Citations: 5

Similar Papers

Smartphone based Indoor Localization Technology using 1D CNN -BLSTM
Changsoo Yu ... Taehun Kim
-
Changsoo Yu, et. al.Changsoo Yu ... Taehun Kim
27 Nov 2022
27 Nov 2022

Gujarati Task Oriented Dialogue Slot Tagging Using Deep Neural Network Models
Rachana Parikh ... Hiren Joshi
-
Rachana Parikh, et. al.Rachana Parikh ... Hiren Joshi
01 Jan 2020
01 Jan 2020

Stacked Convolutional Bidirectional LSTM Recurrent Neural Network for Bearing Anomaly Detection in Rotating Machinery Diagnostics
Kwangsuk Lee ... Jae-Kyeong Kim
-
Kwangsuk Lee, et. al.Kwangsuk Lee ... Jae-Kyeong Kim
01 Jul 2018
01 Jul 2018

Integrating Temporal Fluctuations in Crop Growth with Stacked Bidirectional LSTM and 3D CNN Fusion for Enhanced Crop Yield Prediction
Venkata Rama Rao Kolipaka ... Anupama Namburu
International Journal on Recent and Innovation Trends in Computing and Communication | VOL. 11
Venkata Rama Rao Kolipaka, et. al.Venkata Rama Rao Kolipaka ... Anupama Namburu
27 Oct 2023
International Journal on Recent and Innovation Trends in Computing and Communication | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-level context features extraction for named entity recognition

Abstract

Talk to us

Similar Papers

More From: Computer Speech &amp; Language

More From: Computer Speech & Language