ASRNN: A recurrent neural network with an attention model for sequence labeling

Jerry Chun-Wei Lin,Yinan Shao,Youcef Djenouri,Unil Yun

doi:10.1016/j.knosys.2020.106548

Abstract

Natural language processing (NLP) is useful for handling text and speech, and sequence labeling plays an important role by automatically analyzing a sequence (text) to assign category labels to each part. However, the performance of these conventional models depends greatly on hand-crafted features and task-specific knowledge, which is a time consuming task. Several conditional random fields (CRF)-based models for sequence labeling have been presented, but the major limitation is how to use neural networks for extracting useful representations for each unit or segment in the input sequence. In this paper, we propose an attention segmental recurrent neural network (ASRNN) that relies on a hierarchical attention neural semi-Markov conditional random fields (semi-CRF) model for the task of sequence labeling. Our model uses a hierarchical structure to incorporate character-level and word-level information and applies an attention mechanism to both levels. This enables our method to differentiate more important information from less important information when constructing the segmental representation. We evaluated our model on three sequence labeling tasks, including named entity recognition (NER), chunking, and reference parsing. Experimental results show that the proposed model benefited from the hierarchical structure, and it achieved competitive and robust performance on all three sequence labeling tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ASRNN: A recurrent neural network with an attention model for sequence labeling

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems

Lead the way for us

Journal: Knowledge-Based Systems	Publication Date: Nov 6, 2020
Citations: 171

Similar Papers

Enhanced sequence labeling based on latent variable conditional random fields
Jerry Chun-Wei Lin ... Unil Yun
Neurocomputing | VOL. 403
Jerry Chun-Wei Lin, et. al.Jerry Chun-Wei Lin ... Unil Yun
08 May 2020
Neurocomputing | VOL. 403

End to End Parts of Speech Tagging and Named Entity Recognition in Bangla Language
Jillur Rahman Saurav ... Farida Chowdhury
-
Jillur Rahman Saurav, et. al.Jillur Rahman Saurav ... Farida Chowdhury
01 Sep 2019
01 Sep 2019

Label Attention Network for Structured Prediction
Leyang Cui ... Yafu Li
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30
Leyang Cui, et. al.Leyang Cui ... Yafu Li
01 Jan 2021
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30

Fine-grained acronym expansion identification using latent-state neural structured prediction model
Jie Liu ... Yalou Huang
-
Jie Liu, et. al.Jie Liu ... Yalou Huang
01 Jul 2015
01 Jul 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ASRNN: A recurrent neural network with an attention model for sequence labeling

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems