Sequence Labeling With Deep Gated Dual Path CNN

Lujun Zhao,Xipeng Qiu,Qi Zhang,Xuanjing Huang

doi:10.1109/taslp.2019.2944563

Abstract

Sequence labeling, such as part-of-speech (POS) tagging, named entity recognition (NER), text chunking, is a classic task in natural language processing. Most existing neural networks models for sequence labeling are based on recurrent neural networks. Recently, convolutional neural networks have been proposed to replace the recurrent components for sequence labeling. However, they are usually shallow compared to deep convolutional networks that achieve start-of-the-art performance in other fields. Due to the vanishing gradient problem, these models usually can not work well when simply increasing the number of layers. In this paper, we propose using deep CNN architecture in sequence labeling, which can capture a large context through stacked convolutions. To reduce the vanishing gradient problem, the proposed method incorporates gated linear units, residual connections, and dense connections. Experimental results on three sequence labeling tasks show that the proposed model can achieve competitive performance to the RNN-based state-of-the-art method while maintaining $2.41\times$ faster speed, even with up to 10 convolutional layers.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Sequence Labeling With Deep Gated Dual Path CNN

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE/ACM Transactions on Audio, Speech, and Language Processing	Publication Date: Dec 1, 2019
Citations: 58

Similar Papers

Sequence Labeling of Chinese Text Based on Bidirectional Gru-Cnn-Crf Model
Di Liu ... Xinyi Zou
-
Di Liu, et. al.Di Liu ... Xinyi Zou
01 Dec 2018
01 Dec 2018

Position-aware self-attention based neural sequence labeling
Wei Wei ... Sheng Jiang
Pattern Recognition | VOL. 110
Wei Wei, et. al.Wei Wei ... Sheng Jiang
07 Sep 2020
Pattern Recognition | VOL. 110

A Convolutional Neural Network Model for Setswana Named Entity Recognition
Shumile Chabalala ... Sunday Ojo
International Conference on Artificial Intelligence and its Applications | VOL. 2023
Shumile Chabalala, et. al.Shumile Chabalala ... Sunday Ojo
09 Nov 2023
International Conference on Artificial Intelligence and its Applications | VOL. 2023

Word Embeddings for Natural Language Processing

-

01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sequence Labeling With Deep Gated Dual Path CNN

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing