A Character-Level BiLSTM-CRF Model With Multi-Representations for Chinese Event Detection

Xiaofeng Mu,Aiping Xu

doi:10.1109/access.2019.2943721

Xiaofeng Mu, Aiping Xu

Open Access

PDF Available

https://doi.org/10.1109/access.2019.2943721

Copy DOI

Export

Save

Cite

Journal: IEEE Access	Publication Date: Jan 1, 2019
Citations: 8	License type: CC BY 4.0

Affiliation: Wuhan University

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Using the word as a basic unit may undermine Chinese event detection model’s performance because of the inaccurate word boundaries generated by segmentation tools. Besides, word embeddings are contextual independent and cannot handle the polysemy of event triggers, which may prevent us from obtaining the desired performance. To address these issues, we propose a BiLSTM-CRF (Bidirectional Long Short-Term Memory Conditional Random Field) model using contextualized representations, which regards event detection task as a character-level sequence labeling problem and uses contextualized representations to disambiguate event triggers. Experiments show that our proposed method sets a new state-of-the-art, which proves Chinese characters could replace words for the Chinese event detection task. Besides, using contextualized representation reduces the false positive case, which verifies that this kind of representation could remedy the weakness of the word embedding technique. Based on the results, we believe that character-level models are worth exploring in the future.

Highlights

Event Extraction is a basic task in information extraction field and proposed by the Automatic Content Extraction(ACE)program [1], which has broad application prospects
For the shortage of word embedding, researchers have proposed some contextualized word embedding techniques which learn their representations based on their contexts, such as Context2Vec [10], ELMo [13], and BERT [14], We explore the following two ideas to incorporate contextualized into the BiLSTM-CRF model: 1) We concatenate character embeddings and contextualized representations
3) We find that compared with the model without contextualized representations, the model using these representations can reduce the false positives; either character or contextualized representations cannot help the model to detect unseen triggers

Summary

INTRODUCTION

Event Extraction is a basic task in information extraction field and proposed by the Automatic Content Extraction(ACE). Is constant in different contexts and cannot help the model to discriminate different meanings of a word [9] [10] This shortage hinders performance gains of current Chinese ED neural models because the same event trigger may express different event types. For the shortage of word embedding, researchers have proposed some contextualized word embedding techniques which learn their representations based on their contexts, such as Context2Vec [10], ELMo [13], and BERT [14], We explore the following two ideas to incorporate contextualized into the BiLSTM-CRF model: 1) We concatenate character embeddings and contextualized representations. Our contributions are as follows: 1) We propose a BiLSTM-CRF model for event detection task, which incorporates contextualized representations. We will use the term ‘‘representation’’ interchangeable with the term ‘‘embedding’’ in the following paper

ARCHITECTURE

CONTEXTUALIZED REPRESENTATION

SEGMENTATION REPRESENTATION

COMBINING REPRESENTATIONS

CRF LAYER

DATASETS

BASELINES

RELATED WORK

Findings

CONCLUSION

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

A Character-Level BiLSTM-CRF Model With Multi-Representations for Chinese Event Detection

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Gujarati Task Oriented Dialogue Slot Tagging Using Deep Neural Network Models
Rachana Parikh ... Hiren Joshi
-
Rachana Parikh, et. al.Rachana Parikh ... Hiren Joshi
01 Jan 2020
01 Jan 2020

Learning Context-Aware Representation for Event Detection
Jiale Yuan
-
Jiale YuanJiale Yuan
01 Nov 2021
01 Nov 2021

Named Entity Recognition in Equipment Support Field Using Tri-Training Algorithm and Text Information Extraction Technology
Chenguang Liu ... Peng Wang
IEEE Access | VOL. 9
Chenguang Liu, et. al.Chenguang Liu ... Peng Wang
01 Jan 2020
IEEE Access | VOL. 9

Feature Extraction and Analysis of Natural Language Processing for Deep Learning English Language
Dongyang Wang ... Hongbin Yu
IEEE Access | VOL. 8
Dongyang Wang, et. al.Dongyang Wang ... Hongbin Yu
01 Jan 2020
IEEE Access | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

A Character-Level BiLSTM-CRF Model With Multi-Representations for Chinese Event Detection

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE Access