E2EET: from pipeline to end-to-end entity typing via transformer-based embeddings

Michael Stewart,Wei Liu

doi:10.1007/s10115-021-01626-9

Abstract

Entity typing (ET) is the process of identifying the semantic types of every entity within a corpus. ET involves labelling each entity mention with one or more class labels. As a multi-class, multi-label task, it is considerably more challenging than named entity recognition. This means existing entity typing models require pre-identified mentions and cannot operate directly on plain text. Pipeline-based approaches are therefore used to join a mention extraction model and an entity typing model to process raw text. Another key limiting factor is that these mention-level ET models are trained on fixed context windows, which makes the entity typing results sensitive to window size selection. In light of these drawbacks, we propose an end-to-end entity typing model (E2EET) using a Bi-GRU to remove the dependency on window size. To demonstrate the effectiveness of our E2EET model, we created a stronger baseline mention-level model by incorporating the latest contextualised transformer-based embeddings (BERT). Extensive ablative studies demonstrate the competitiveness and simplicity of our end-to-end model for entity typing.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

E2EET: from pipeline to end-to-end entity typing via transformer-based embeddings

Abstract

Talk to us

Similar Papers

More From: Knowledge and Information Systems

Lead the way for us

Similar Papers

An End-To-End NER Model with Explicit Boundary and Type Information
Ying Feng ... Zhe Chen
Journal of Physics: Conference Series | VOL. 2337
Ying Feng, et. al.Ying Feng ... Zhe Chen
01 Sep 2022
Journal of Physics: Conference Series | VOL. 2337

Terminologies augmented recurrent neural network model for clinical named entity recognition.
Ivan Lerner ... Xavier Tannier
Journal of Biomedical Informatics | VOL. 102
Ivan Lerner, et. al.Ivan Lerner ... Xavier Tannier
16 Dec 2019
Journal of Biomedical Informatics | VOL. 102

Research on Chinese medical named entity recognition based on collaborative cooperation of multiple neural network models.
Bin Ji ... Qingbo Wu
Journal of Biomedical Informatics | VOL. 104
Bin Ji, et. al.Bin Ji ... Qingbo Wu
25 Feb 2020
Journal of Biomedical Informatics | VOL. 104

TaggerOne: joint named entity recognition and normalization with semi-Markov Models.
Robert Leaman ... Zhiyong Lu
Bioinformatics | VOL. 32
Robert Leaman, et. al.Robert Leaman ... Zhiyong Lu
09 Jun 2016
Bioinformatics | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

E2EET: from pipeline to end-to-end entity typing via transformer-based embeddings

Abstract

Talk to us

Similar Papers

More From: Knowledge and Information Systems