Comparison of named entity recognition methodologies in biomedical documents

Hye-Jeong Song,Yu-Seop Kim,Jong-Dae Kim,Byeong-Cheol Jo,Chan-Young Park

doi:10.1186/s12938-018-0573-6

Hye-Jeong Song, Yu-Seop Kim + Show 3 more

Open Access

https://doi.org/10.1186/s12938-018-0573-6

Copy DOI

Journal: BioMedical Engineering OnLine	Publication Date: Nov 1, 2018
Citations: 37	License type: open-access

Affiliation: Hallym University

Abstract

BackgroundBiomedical named entity recognition (Bio-NER) is a fundamental task in handling biomedical text terms, such as RNA, protein, cell type, cell line, and DNA. Bio-NER is one of the most elementary and core tasks in biomedical knowledge discovery from texts. The system described here is developed by using the BioNLP/NLPBA 2004 shared task. Experiments are conducted on a training and evaluation set provided by the task organizers.ResultsOur results show that, compared with a baseline having a 70.09% F1 score, the RNN Jordan- and Elman-type algorithms have F1 scores of approximately 60.53% and 58.80%, respectively. When we use CRF as a machine learning algorithm, CCA, GloVe, and Word2Vec have F1 scores of 72.73%, 72.74%, and 72.82%, respectively.ConclusionsBy using the word embedding constructed through the unsupervised learning, the time and cost required to construct the learning data can be saved.

Highlights

Biomedical named entity recognition (Bio-Named entity recognition (NER)) is a fundamental task in handling biomedical text terms, such as RNA, protein, cell type, cell line, and DNA
Biomedical named entity recognition is very important in language processing of biomedical texts, especially in extracting information of proteins and genes such as RNA or DNA from documents
We compare the performance of recurrent neural network (RNN) and conditional random fields (CRFs) with word embedding

Summary

Introduction

Biomedical named entity recognition (Bio-NER) is a fundamental task in handling biomedical text terms, such as RNA, protein, cell type, cell line, and DNA. Named entity recognition (NER) assigns a named entity tag to a designated word by using rules and heuristics. The named entity, which presents a human, location, and an organization, should be recognized [1]. Named entity recognition is a task that extracts nominal and numeric information from a document and classifies the word into a person, an organization, or a date category [2]. Biomedical named entity recognition is very important in language processing of biomedical texts, especially in extracting information of proteins and genes such as RNA or DNA from documents. Finding named entities of genes from texts is a very important and difficult task [3].

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparison of named entity recognition methodologies in biomedical documents

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BioMedical Engineering OnLine

Lead the way for us

Similar Papers

Using MEDLINE as Standard Corpus for Measuring Semantic Similarity in the Biomedical Domain
Hisham Al-Mubaid ... Hoa Nguyen
-
Hisham Al-Mubaid, et. al.Hisham Al-Mubaid ... Hoa Nguyen
01 Jan 2006
01 Jan 2006

Co-decision matrix framework for name entity recognition in biomedical text.
Haochang Wang ... Yu Li
International journal of data mining and bioinformatics | VOL. 11
Haochang Wang, et. al.Haochang Wang ... Yu Li
01 Jan 2015
International journal of data mining and bioinformatics | VOL. 11

Artificial Intelligence-related Literature in Transplantation: A Practical Guide.
Sook Hyeon Park ... Sanjay Mehrotra
Transplantation | VOL. 105
Sook Hyeon Park, et. al.Sook Hyeon Park ... Sanjay Mehrotra
18 Aug 2020
Transplantation | VOL. 105

Concept Name Similarity Measure on SNOMED CT
Htet Htet Htun ... Virach Sornlertlamvanich
-
Htet Htet Htun, et. al.Htet Htet Htun ... Virach Sornlertlamvanich
01 Jan 2017
01 Jan 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison of named entity recognition methodologies in biomedical documents

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BioMedical Engineering OnLine