BioRel: towards large-scale biomedical relation extraction

Rui Xing,Tengwei Song,Jie Luo

doi:10.1186/s12859-020-03889-5

Abstract

BackgroundAlthough biomedical publications and literature are growing rapidly, there still lacks structured knowledge that can be easily processed by computer programs. In order to extract such knowledge from plain text and transform them into structural form, the relation extraction problem becomes an important issue. Datasets play a critical role in the development of relation extraction methods. However, existing relation extraction datasets in biomedical domain are mainly human-annotated, whose scales are usually limited due to their labor-intensive and time-consuming nature.ResultsWe construct BioRel, a large-scale dataset for biomedical relation extraction problem, by using Unified Medical Language System as knowledge base and Medline as corpus. We first identify mentions of entities in sentences of Medline and link them to Unified Medical Language System with Metamap. Then, we assign each sentence a relation label by using distant supervision. Finally, we adapt the state-of-the-art deep learning and statistical machine learning methods as baseline models and conduct comprehensive experiments on the BioRel dataset.ConclusionsBased on the extensive experimental results, we have shown that BioRel is a suitable large-scale datasets for biomedical relation extraction, which provides both reasonable baseline performance and many remaining challenges for both deep learning and statistical methods.

Highlights

Biomedical publications and literature are growing rapidly, there still lacks structured knowledge that can be processed by computer programs
All the biomedical knowledge in these publications is expressed in the form of unstructured text, which cannot be utilized by computer programs
The first three datasets are used for generalpurpose relation extraction and the remaining for biomedical domain

Summary

Introduction

Biomedical publications and literature are growing rapidly, there still lacks structured knowledge that can be processed by computer programs. In order to extract such knowledge from plain text and transform them into structural form, the relation extraction problem becomes an important issue. Datasets play a critical role in the development of relation extraction methods. Existing relation extraction datasets in biomedical domain are mainly human-annotated, whose scales are usually limited due to their labor-intensive and time-consuming nature

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Dec 1, 2020
Citations: 19	License type: open-access

R Discovery Prime

R Discovery Prime

BioRel: towards large-scale biomedical relation extraction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

BioRel: A Large-Scale Dataset for Biomedical Relation Extraction
Rui Xing ... Tengwei Song
-
Rui Xing, et. al.Rui Xing ... Tengwei Song
01 Nov 2019
01 Nov 2019

A hybrid approach toward biomedical relation extraction training corpora: combining distant supervision with crowdsourcing.
Diana Sousa ... Francisco M Couto
Database : the journal of biological databases and curation | VOL. 2020
Diana Sousa, et. al.Diana Sousa ... Francisco M Couto
01 Dec 2020
Database : the journal of biological databases and curation | VOL. 2020

Distantly supervised biomedical relation extraction using piecewise attentive convolutional neural network and reinforcement learning
Tiantian Zhu ... Yang Xiang
Journal of the American Medical Informatics Association | VOL. 28
Tiantian Zhu, et. al.Tiantian Zhu ... Yang Xiang
15 Sep 2021
Journal of the American Medical Informatics Association | VOL. 28

Integrating deep learning architectures for enhanced biomedical relation extraction: a pipeline approach.
M Janina Sarol ... Halil Kilicoglu
Database : the journal of biological databases and curation | VOL. 2024
M Janina Sarol, et. al.M Janina Sarol ... Halil Kilicoglu
28 Aug 2024
Database : the journal of biological databases and curation | VOL. 2024

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

BioRel: towards large-scale biomedical relation extraction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics