Enhanced sequence labeling based on latent variable conditional random fields

Jerry Chun-Wei Lin,Yinan Shao,Ji Zhang,Unil Yun

doi:10.1016/j.neucom.2020.04.102

Abstract

Natural language processing is a useful processing technique of language data, such as text and speech. Sequence labeling represents the upstream task of many natural language processing tasks, such as machine translation, text classification, and sentiment classification. In this paper, the focus is on the sequence labeling task, in which semantic labels are assigned to each unit of a given input sequence. Two frameworks of latent variable conditional random fields (CRF) models (called LVCRF-I and LVCRF-II) are proposed, which use the encoding schema as a latent variable to capture the latent structure of the hidden variables and the observed data. Among the two designed models, the LVCRF-I model focuses on the sentence level, while the LVCRF-II works in the word level, to choose the best encoding schema for a given input sequence automatically without handcraft features. In the experiments, the two proposed models are verified by four sequence prediction tasks, including named entity recognition (NER), chunking, reference parsing and POS tagging. The proposed frameworks achieve better performance without using other handcraft features than the conventional CRF model. Moreover, these designed frameworks can be viewed as a substitution of the conventional CRF models. In the commonly used LSTM-CRF models, the CRF layer can be replaced with our proposed framework as they use the same training and inference procedure. The experimental results show that the proposed models exhibit latent variable and provide competitive and robust performance on all three sequence prediction tasks.

Highlights

Sequence labeling is often the first step in text data processing
In order to provide a clear explanation of the models, we briefly introduce the conventional conditional random fields (CRF) model, present the proposed latent variable CRF models, and explain the main difference between these models
We propose a framework of the CRF that uses the encoding schema as a latent variable

Summary

Introduction

Sequence labeling represents the task of identifying and assigning a semantic label to each unit/subsequence of the input sequences. With B in the BIO encoding schema because it represents the beginning of a person entity, and it is marked with U in the BILOU encoding schema wherein it denotes a unit length person entity. Different encoding schemas can lead to different performance on different models and sequence-labeling tasks. In this paper, two latent variable CRFs, which can automatically choose the best encoding scheme for a given input sentence, are proposed. The two proposed models use different encoding schemas, as a latent variable in the conventional CRF in two ways. The performance of the proposed latent variable model is much better than the conventional CRF with the BIO or BILOU encoding schema

Literature review

Latent variable CRF

Encoding Schema

Problem Statement

Proposed latent variable CRF models

Conventional CRF

ZðxÞ expfW

Latent Variable CRF-I

Latent variable CRF-II

Features

Experimental evaluation

Datasets

Named entity recognition

Reference parsing

Chunking

POS tagging

Conclusion

Declaration of Competing Interest

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Neurocomputing	Publication Date: May 8, 2020
Citations: 38	License type: cc-by

R Discovery Prime

R Discovery Prime

Enhanced sequence labeling based on latent variable conditional random fields

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Similar Papers

ASRNN: A recurrent neural network with an attention model for sequence labeling
Jerry Chun-Wei Lin ... Unil Yun
Knowledge Based Systems | VOL. 212
Jerry Chun-Wei Lin, et. al.Jerry Chun-Wei Lin ... Unil Yun
06 Nov 2020
Knowledge Based Systems | VOL. 212

End to End Parts of Speech Tagging and Named Entity Recognition in Bangla Language
Jillur Rahman Saurav ... Summit Haque
-
Jillur Rahman Saurav, et. al.Jillur Rahman Saurav ... Summit Haque
01 Sep 2019
01 Sep 2019

A fine-grained Chinese word segmentation and part-of-speech tagging corpus for clinical text
Ying Xiong ... Qingcai Chen
BMC medical informatics and decision making | VOL. 19
Ying Xiong, et. al.Ying Xiong ... Qingcai Chen
01 Apr 2019
BMC medical informatics and decision making | VOL. 19

Part-of-Speech Tagging of Odia Language Using Statistical and Deep Learning Based Approaches
Tusarkanta Dalai ... Tapas Kumar Mishra
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 22
Tusarkanta Dalai, et. al.Tusarkanta Dalai ... Tapas Kumar Mishra
16 Jun 2023
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhanced sequence labeling based on latent variable conditional random fields

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Neurocomputing