Contextual Reinforcement, Entity Delimitation and Generative Data Augmentation for Entity Recognition and Relation Extraction in Official Documents

Fabiano Muniz Belém,Gabriel Teixeira,Celso França,Marcelo Ganem,Gabriel Jallais,Cláudio Valiense,Marcos A Gonçalves,Alberto H F Laender,Marcos Carvalho

doi:10.5753/jidm.2023.3180

Abstract

Transformer architectures have become the main component of various state-of-the-art methods for natural language processing tasks, such as Named Entity Recognition and Relation Extraction (NER+RE). As these architectures rely on semantic (contextual) aspects of word sequences, they may fail to accurately identify and delimit entity spans when there is little semantic context surrounding the named entities. This is the case of entities composed only by digits and punctuation, such as IDs and phone numbers, as well as long composed names. In this article, we propose new techniques for contextual reinforcement and entity delimitation based on pre- and post-processing techniques to provide a richer semantic context, improving SpERT, a state-of-the-art Span-based Entity and Relation Transformer. To provide further context to the training process of NER+RE, we propose a data augmentation technique based on Generative Pretrained Transformers (GPT). We evaluate our strategies using real data from public administration documents (official gazettes and biddings) and court lawsuits. Our results show that our pre- and post-processing strategies, when used co-jointly, allows significant improvements on NER+ER effectiveness, while we also show the benefits of using GPT for training data augmentation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Contextual Reinforcement, Entity Delimitation and Generative Data Augmentation for Entity Recognition and Relation Extraction in Official Documents

Abstract

Talk to us

Similar Papers

More From: Journal of Information and Data Management

Lead the way for us

Similar Papers

A Trigger-Sense Memory Flow Framework for Joint Entity and Relation Extraction
Yongliang Shen ... Weiming Lu
-
Yongliang Shen, et. al.Yongliang Shen ... Weiming Lu
19 Apr 2021
19 Apr 2021

Integrated Extraction of Entities and Relations via Attentive Graph Convolutional Networks
Chuhan Gao ... Yueting Meng
Electronics | VOL. 13
Chuhan Gao, et. al.Chuhan Gao ... Yueting Meng
08 Nov 2024
Electronics | VOL. 13

People Summarization by Combining Named Entity Recognition and Relation Extraction
Xiaojiang Liu ... Nenghai Yu
Journal of Convergence Information Technology | VOL. 5
Xiaojiang Liu , et. al.Xiaojiang Liu ... Nenghai Yu
31 Dec 2010
Journal of Convergence Information Technology | VOL. 5

Negation-based transfer learning for improving biomedical Named Entity Recognition and Relation Extraction
Hermenegildo Fabregat ... Lourdes Araujo
Journal of Biomedical Informatics | VOL. 138
Hermenegildo Fabregat, et. al.Hermenegildo Fabregat ... Lourdes Araujo
04 Jan 2023
Journal of Biomedical Informatics | VOL. 138

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Contextual Reinforcement, Entity Delimitation and Generative Data Augmentation for Entity Recognition and Relation Extraction in Official Documents

Abstract

Talk to us

Similar Papers

More From: Journal of Information and Data Management