CWPC_BiAtt: Character–Word–Position Combined BiLSTM-Attention for Chinese Named Entity Recognition

Shardrom Johnson,Yuanchen Liu,Sherlock Shen

doi:10.3390/info11010045

Abstract

Usually taken as linguistic features by Part-Of-Speech (POS) tagging, Named Entity Recognition (NER) is a major task in Natural Language Processing (NLP). In this paper, we put forward a new comprehensive-embedding, considering three aspects, namely character-embedding, word-embedding, and pos-embedding stitched in the order we give, and thus get their dependencies, based on which we propose a new Character–Word–Position Combined BiLSTM-Attention (CWPC_BiAtt) for the Chinese NER task. Comprehensive-embedding via the Bidirectional Llong Short-Term Memory (BiLSTM) layer can get the connection between the historical and future information, and then employ the attention mechanism to capture the connection between the content of the sentence at the current position and that at any location. Finally, we utilize Conditional Random Field (CRF) to decode the entire tagging sequence. Experiments show that CWPC_BiAtt model we proposed is well qualified for the NER task on Microsoft Research Asia (MSRA) dataset and Weibo NER corpus. A high precision and recall were obtained, which verified the stability of the model. Position-embedding in comprehensive-embedding can compensate for attention-mechanism to provide position information for the disordered sequence, which shows that comprehensive-embedding has completeness. Looking at the entire model, our proposed CWPC_BiAtt has three distinct characteristics: completeness, simplicity, and stability. Our proposed CWPC_BiAtt model achieved the highest F-score, achieving the state-of-the-art performance in the MSRA dataset and Weibo NER corpus.

Highlights

Named Entity Recognition (NER) plays an important role in the field of natural language processing
If the prediction is failed, and the positive class is predicted as a false negative (FN), so the recall rate is defined as the following formula: true positive (TP)
We found that [22,23] used traditional machine learning for NER (Named Entity Recognition) tasks, which required complex feature engineering, and that training results would not be so effective if they had fewer features

Summary

Introduction

Named Entity Recognition (NER) plays an important role in the field of natural language processing. In recent years, it has gradually become an essential component of information extraction technologies [1]. The Long Short-Term Memory (LSTM) was proposed in 1997 by [18], having achieved unprecedented performance in the field of NLP in recent years. In the NER task, the use of Bidirectional Long Short Term Memory-Convolutional Neural Networks (BiLSTM-CNNS) on the CoNLL-2003 data set by [19] achieved a good score of 91.23%. Taking into account the above considerations, we proposed a new model, Character-Word-Position Combined BiLSTM-Attention (CWPC_BiAtt), for Chinese

Related Work

Attention Mechanism

Comprehensive-Embedding

BiLSTM Layer

The main ofto is as follows

Attention Layer

Global Self-Attention

Local Self-Attention

CRF Layer

Evaluation Metrics

MSRA Dataset

Weibo NER Corpus

Settings

Experiment and Analyze on MSRA Dataset

Experiment and Analyze on Weibo NER Corpus

Findings

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Information	Publication Date: Jan 15, 2020
Citations: 15	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

CWPC_BiAtt: Character–Word–Position Combined BiLSTM-Attention for Chinese Named Entity Recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information

Lead the way for us

Similar Papers

Negation-based transfer learning for improving biomedical Named Entity Recognition and Relation Extraction
Hermenegildo Fabregat ... Lourdes Araujo
Journal of Biomedical Informatics | VOL. 138
Hermenegildo Fabregat, et. al.Hermenegildo Fabregat ... Lourdes Araujo
04 Jan 2023
Journal of Biomedical Informatics | VOL. 138

A CRF-Based Stacking Model with Meta-features for Named Entity Recognition
Shifeng Liu ... Wei Wang
-
Shifeng Liu, et. al.Shifeng Liu ... Wei Wang
01 Jan 2018
01 Jan 2018

Adaptive Co-attention Network for Named Entity Recognition in Tweets
Qi Zhang ... Jinlan Fu
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 32
Qi Zhang, et. al.Qi Zhang ... Jinlan Fu
27 Apr 2018
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 32

A fine-grained Chinese word segmentation and part-of-speech tagging corpus for clinical text
Ying Xiong ... Qingcai Chen
BMC Medical Informatics and Decision Making | VOL. 19
Ying Xiong, et. al.Ying Xiong ... Qingcai Chen
01 Apr 2019
BMC Medical Informatics and Decision Making | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CWPC_BiAtt: Character–Word–Position Combined BiLSTM-Attention for Chinese Named Entity Recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information