BiodivNERE: Gold standard corpora for named entity recognition and relation extraction in the biodiversity domain

Nora Abdelmageed,Leila Feddoul,Anahita Kazem,Birgitta König-Ries,Sheeba Samuel,Alsayed Algergawy,Felicitas Löffler,Jitendra Gaikwad

doi:10.3897/bdj.10.e89481

Abstract

Biodiversity is the assortment of life on earth covering evolutionary, ecological, biological, and social forms. To preserve life in all its variety and richness, it is imperative to monitor the current state of biodiversity and its change over time and to understand the forces driving it. This need has resulted in numerous works being published in this field. With this, a large amount of textual data (publications) and metadata (e.g. dataset description) has been generated. To support the management and analysis of these data, two techniques from computer science are of interest, namely Named Entity Recognition (NER) and Relation Extraction (RE). While the former enables better content discovery and understanding, the latter fosters the analysis by detecting connections between entities and, thus, allows us to draw conclusions and answer relevant domain-specific questions. To automatically predict entities and their relations, machine/deep learning techniques could be used. The training and evaluation of those techniques require labelled corpora. In this paper, we present two gold-standard corpora for Named Entity Recognition (NER) and Relation Extraction (RE) generated from biodiversity datasets metadata and abstracts that can be used as evaluation benchmarks for the development of new computer-supported tools that require machine learning or deep learning techniques. These corpora are manually labelled and verified by biodiversity experts. In addition, we explain the detailed steps of constructing these datasets. Moreover, we demonstrate the underlying ontology for the classes and relations used to annotate such corpora.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Biodiversity Data Journal	Publication Date: Oct 7, 2022
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

BiodivNERE: Gold standard corpora for named entity recognition and relation extraction in the biodiversity domain

Abstract

Talk to us

Similar Papers

More From: Biodiversity Data Journal

Lead the way for us

Similar Papers

BiodiViz: Leveraging NER and RE for Automated Knowledge Graph Generation in Biodiversity Research
Angela Shannen Tan ... Roselyn Gabud
Biodiversity Information Science and Standards | VOL. 8
Angela Shannen Tan, et. al.Angela Shannen Tan ... Roselyn Gabud
29 Oct 2024
Biodiversity Information Science and Standards | VOL. 8

A Trigger-Sense Memory Flow Framework for Joint Entity and Relation Extraction
Yongliang Shen ... Weiming Lu
-
Yongliang Shen, et. al.Yongliang Shen ... Weiming Lu
19 Apr 2021
19 Apr 2021

Integrated Extraction of Entities and Relations via Attentive Graph Convolutional Networks
Chuhan Gao ... Yueting Meng
Electronics | VOL. 13
Chuhan Gao, et. al.Chuhan Gao ... Yueting Meng
08 Nov 2024
Electronics | VOL. 13

People Summarization by Combining Named Entity Recognition and Relation Extraction
Xiaojiang Liu ... Nenghai Yu
Journal of Convergence Information Technology | VOL. 5
Xiaojiang Liu , et. al.Xiaojiang Liu ... Nenghai Yu
31 Dec 2010
Journal of Convergence Information Technology | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

BiodivNERE: Gold standard corpora for named entity recognition and relation extraction in the biodiversity domain

Abstract

Talk to us

Similar Papers

More From: Biodiversity Data Journal