Semi-supervised Multitask Learning for Sequence Labeling

Marek Rei

doi:10.18653/v1/p17-1194

Abstract

We propose a sequence labeling framework with a secondary training objective, learning to predict surrounding words for every word in the dataset. This language modeling objective incentivises the system to learn general-purpose patterns of semantic and syntactic composition, which are also useful for improving accuracy on different sequence labeling tasks. The architecture was evaluated on a range of datasets, covering the tasks of error detection in learner texts, named entity recognition, chunking and POS-tagging. The novel language modeling objective provided consistent performance improvements on every benchmark, without requiring any additional annotated or unannotated data.

Highlights

Accurate and efficient sequence labeling models have a wide range of applications, including named entity recognition (NER), part-of-speech (POS) tagging, error detection and shallow parsing
The proposed architecture was evaluated on 10 different sequence labeling datasets, covering the tasks of error detection, NER, chunking, and POStagging
We performed experiments on the development data where the value of γ was gradually decreased, but found that a small static value performed comparably well or even better. These experiments indicate that the language modeling objective helps the network learn general-purpose features that are useful for sequence labeling even in the later stages of training

Summary

Introduction

Accurate and efficient sequence labeling models have a wide range of applications, including named entity recognition (NER), part-of-speech (POS) tagging, error detection and shallow parsing. Recent work has shown that neural network architectures are able to achieve comparable or improved performance, while automatically discovering useful features for a specific task and only requiring a sequence of tokens as input (Collobert et al, 2011; Irsoy and Cardie, 2014; Lample et al, 2016) This feature discovery is usually driven by an objective function based on predicting the annotated labels for each word, without much incentive to learn more general language features from the available text. This secondary unsupervised objective encourages the framework to learn richer features for semantic composition without requiring additional training data. This multitask training framework gives the largest improvements on error detection datasets, outperforming the previous state-of-the-art architecture

Neural Sequence Labeling

Language Modeling Objective

Evaluation Setup

Error Detection

NER and Chunking

POS tagging

Related Work

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Semi-supervised Multitask Learning for Sequence Labeling

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2017
Citations: 241	License type: cc-by

Similar Papers

Improving sequence labeling with labeled clue sentences
Qianlong Wang ... Ruifeng Xu
Knowledge-Based Systems | VOL. 257
Qianlong Wang, et. al.Qianlong Wang ... Ruifeng Xu
07 Sep 2022
Knowledge-Based Systems | VOL. 257

ASRNN: A recurrent neural network with an attention model for sequence labeling
Jerry Chun-Wei Lin ... Unil Yun
Knowledge-Based Systems | VOL. 212
Jerry Chun-Wei Lin, et. al.Jerry Chun-Wei Lin ... Unil Yun
06 Nov 2020
Knowledge-Based Systems | VOL. 212

Improving deep learning method for biomedical named entity recognition by using entity definition information
Ying Xiong ... Yi Zhou
BMC Bioinformatics | VOL. 22
Ying Xiong, et. al.Ying Xiong ... Yi Zhou
01 Dec 2021
BMC Bioinformatics | VOL. 22

PNER: Applying the Pipeline Method to Resolve Nested Issues in Named Entity Recognition
Hongjian Yang ... Qinghao Zhang
Applied Sciences | VOL. 14
Hongjian Yang, et. al.Hongjian Yang ... Qinghao Zhang
20 Feb 2024
Applied Sciences | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Semi-supervised Multitask Learning for Sequence Labeling

Abstract

Highlights

Summary

Talk to us

Similar Papers