Performance Of Relation Extraction Research Articles

Keyword extraction from Knowledge Bases underpins the definition of relevancy in Digital Library search systems. However, it is the pertinent task of Joint Relation Extraction, which populates the Knowledge Bases from which results are retrieved. Recent work focuses on fine-tuned, Pre-trained Transformers. Yet, F1 scores for scientific literature achieve just 53.2, versus 69 in the general domain. The research demonstrates the failure of existing work to evidence the rationale for optimisations to finetuned classifiers. In contrast, emerging research subjectively adopts the common belief that Natural Language Processing techniques fail to derive context and shared knowledge. In fact, global context and shared knowledge account for just 10.4% and 11.2% of total relation misclassifications, respectively. In this work, the novel employment of semantic text analysis presents objective challenges for the Transformer-based classification of Joint Relation Extraction. This is the first known work to quantify that pipelined error propagation accounts for 45.3% of total relation misclassifications, the most poignant challenge in this domain. More specifically, Part-of-Speech tagging highlights the misclassification of complex noun phrases, accounting for 25.47% of relation misclassifications. Furthermore, this study identifies two limitations in the purported bidirectionality of the Bidirectional Encoder Representations from Transformers (BERT) Pre-trained Language Model. Firstly, there is a notable imbalance in the misclassification of right-to-left relations, which occurs at a rate double that of left-to-right relations. Additionally, a failure to recognise local context through determiners and prepositions contributes to 16.04% of misclassifications. Furthermore, it is highlighted that the annotation scheme of the singular dataset utilised in existing research, Scientific Entities, Relations and Coreferences (SciERC), is marred by ambiguity. Notably, two asymmetric relations within this dataset achieve recall rates of only 10% and 29%.

Read full abstract

The relationship between biomedical entities is complex, and many of them have not yet been identified. For many biomedical research areas including drug discovery, it is of paramount importance to identify the relationships that have already been established through a comprehensive literature survey. However, manually searching through literature is difficult as the amount of biomedical publications continues to increase. Therefore, the relation classification task, which automatically mines meaningful relations from the literature, is spotlighted in the field of biomedical text mining. By applying relation classification techniques to the accumulated biomedical literature, existing semantic relations between biomedical entities that can help to infer previously unknown relationships are efficiently grasped. To develop semantic relation classification models, which is a type of supervised machine learning, it is essential to construct a training dataset that is manually annotated by biomedical experts with semantic relations among biomedical entities. Any advanced model must be trained on a dataset with reliable quality and meaningful scale to be deployed in the real world and can assist biologists in their research. In addition, as the number of such public datasets increases, the performance of machine learning algorithms can be accurately revealed and compared by using those datasets as a benchmark for model development and improvement. In this paper, we aim to build such a dataset. Along with that, to validate the usability of the dataset as training data for relation classification models and to improve the performance of the relation extraction task, we built a relation classification model based on Bidirectional Encoder Representations from Transformers (BERT) trained on our dataset, applying our newly proposed fine-tuning methodology. In experiments comparing performance among several models based on different deep learning algorithms, our model with the proposed fine-tuning methodology showed the best performance. The experimental results show that the constructed training dataset is an important information resource for the development and evaluation of semantic relation extraction models. Furthermore, relation extraction performance can be improved by integrating our proposed fine-tuning methodology. Therefore, this can lead to the promotion of future text mining research in the biomedical field.

Read full abstract

Performance Of Relation Extraction Research Articles

Related Topics

Articles published on Performance Of Relation Extraction

Biomedical relation extraction method based on ensemble learning and attention mechanism

Integrating AI with medical industry chain data: enhancing clinical nutrition research through semantic knowledge graphs.

A task‐centric knowledge graph construction method based on multi‐modal representation learning for industrial maintenance automation

Multimodal learning for temporal relation extraction in clinical texts.

Ensemble pretrained language models to extract biomedical knowledge from literature.

Leveraging Semantic Text Analysis to Improve the Performance of Transformer-Based Relation Extraction

A Study of Entity Relationship Extraction Algorithms Based on Symmetric Interaction between Data, Models, and Inference Algorithms

Influence of Context in Transformer-Based Medication Relation Extraction.

Document-level relation extraction with three channels

Interactive Lexical and Semantic Graphs for Semisupervised Relation Extraction.

Knowledge-enhanced event relation extraction via event ontology prompt

Clinical concept and relation extraction using prompt-based machine reading comprehension.

REEGAT: RoBERTa Entity Embedding and Graph Attention Networks Enhanced Sentence Representation for Relation Extraction

Multi-task learning for few-shot biomedical relation extraction

Document-level relation extraction with multi-layer heterogeneous graph attention network

Transfer Learning for Low-Resource Multilingual Relation Classification

Joint learning-based causal relation extraction from biomedical literature

Deep learning-based relation extraction and knowledge graph-based representation of construction safety requirements

Relation Extraction in Biomedical Texts Based on Multi-Head Attention Model With Syntactic Dependency Feature: Modeling Study.

BertSRC: transformer-based semantic relation classification

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Performance Of Relation Extraction Research Articles

Related Topics

Articles published on Performance Of Relation Extraction

Biomedical relation extraction method based on ensemble learning and attention mechanism

Integrating AI with medical industry chain data: enhancing clinical nutrition research through semantic knowledge graphs.

A task‐centric knowledge graph construction method based on multi‐modal representation learning for industrial maintenance automation

Multimodal learning for temporal relation extraction in clinical texts.

Ensemble pretrained language models to extract biomedical knowledge from literature.

Leveraging Semantic Text Analysis to Improve the Performance of Transformer-Based Relation Extraction

A Study of Entity Relationship Extraction Algorithms Based on Symmetric Interaction between Data, Models, and Inference Algorithms

Influence of Context in Transformer-Based Medication Relation Extraction.

Document-level relation extraction with three channels

Interactive Lexical and Semantic Graphs for Semisupervised Relation Extraction.

Knowledge-enhanced event relation extraction via event ontology prompt

Clinical concept and relation extraction using prompt-based machine reading comprehension.

REEGAT: RoBERTa Entity Embedding and Graph Attention Networks Enhanced Sentence Representation for Relation Extraction

Multi-task learning for few-shot biomedical relation extraction

Document-level relation extraction with multi-layer heterogeneous graph attention network

Transfer Learning for Low-Resource Multilingual Relation Classification

Joint learning-based causal relation extraction from biomedical literature

Deep learning-based relation extraction and knowledge graph-based representation of construction safety requirements

Relation Extraction in Biomedical Texts Based on Multi-Head Attention Model With Syntactic Dependency Feature: Modeling Study.

BertSRC: transformer-based semantic relation classification