Stacked ensemble combined with fuzzy matching for biomedical named entity recognition of diseases

Balu Bhasuran,Gurusamy Murugesan,Sabenabanu Abdulkadhar,Jeyakumar Natarajan

doi:10.1016/j.jbi.2016.09.009

Balu Bhasuran, Gurusamy Murugesan + Show 2 more

Open Access

https://doi.org/10.1016/j.jbi.2016.09.009

Copy DOI

Journal: Journal of Biomedical Informatics	Publication Date: Sep 12, 2016
Citations: 53	License type: publisher-specific-oa

Affiliation: Bharathiar University

Abstract

Biomedical Named Entity Recognition (Bio-NER) is the crucial initial step in the information extraction process and a majorly focused research area in biomedical text mining. In the past years, several models and methodologies have been proposed for the recognition of semantic types related to gene, protein, chemical, drug and other biological relevant named entities. In this paper, we implemented a stacked ensemble approach combined with fuzzy matching for biomedical named entity recognition of disease names. The underlying concept of stacked generalization is to combine the outputs of base-level classifiers using a second-level meta-classifier in an ensemble. We used Conditional Random Field (CRF) as the underlying classification method that makes use of a diverse set of features, mostly based on domain specific, and are orthographic and morphologically relevant. In addition, we used fuzzy string matching to tag rare disease names from our in-house disease dictionary. For fuzzy matching, we incorporated two best fuzzy search algorithms Rabin Karp and Tuned Boyer Moore. Our proposed approach shows promised result of 94.66%, 89.12%, 84.10%, and 76.71% of F-measure while on evaluating training and testing set of both NCBI disease and BioCreative V CDR Corpora.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Stacked ensemble combined with fuzzy matching for biomedical named entity recognition of diseases

Abstract

Talk to us

Similar Papers

More From: Journal of Biomedical Informatics

Lead the way for us

Similar Papers

D3NER: biomedical named entity recognition using CRF-biLSTM improved with fine-tuned embeddings of various linguistic information.
Thanh Hai Dang ... Hoang-Quynh Le
Bioinformatics | VOL. 34
Thanh Hai Dang, et. al.Thanh Hai Dang ... Hoang-Quynh Le
30 Apr 2018
Bioinformatics | VOL. 34

A comparative study of segment representation for biomedical named entity recognition
H L Shashirekha ... Hamada A Nayel
-
H L Shashirekha, et. al.H L Shashirekha ... Hamada A Nayel
01 Sep 2016
01 Sep 2016

CRFs based parallel biomedical named entity recognition algorithm employing MapReduce framework
Zhuo Tang ... Li Yang
Cluster Computing | VOL. 18
Zhuo Tang, et. al.Zhuo Tang ... Li Yang
22 Jan 2015
Cluster Computing | VOL. 18

Biomedical named entity recognition using deep neural networks with contextual information
Hyejin Cho ... Hyunju Lee
BMC Bioinformatics | VOL. 20
Hyejin Cho, et. al.Hyejin Cho ... Hyunju Lee
01 Dec 2019
BMC Bioinformatics | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Stacked ensemble combined with fuzzy matching for biomedical named entity recognition of diseases

Abstract

Talk to us

Similar Papers

More From: Journal of Biomedical Informatics