Advanced Feature-Driven Disease Named Entity Recognition Using Conditional Random Fields

Hidayat Rahman,Thomas Hahn,Richard Segall

doi:10.1145/2975167.2985635

Abstract

Named Entity Recognition (NER) in biomedical literature is a very active research area. NER is a crucial component of biomedical text mining because it allows for information retrieval, reasoning and knowledge discovery. Much research has been carried out in this area using semantic type categories, such as DNA, RNA, proteins and genes. However, disease NER has not received its needed attention yet, specifically human disease NER. Traditional machine learning approaches lack the precision for disease NER, due to their dependence on token level features, sentence level features and the integration of features, such as orthographic, contextual and linguistic features. In this paper a method for disease NER is proposed which utilizes sentence and token level features based on Conditional Random Fields (CRF) using the NCBI disease corpus. Our system utilizes enrich features including orthographic, contextual, affixes, bigrams, part of speech and stem based features. Using these feature sets our approach has achieved a maximum F-score of 94% for the training set by applying 10 fold cross validation for semantic labeling of the NCBI disease corpus. For testing and development corpus the model has achieved an F-score of 88% and 85% respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Advanced Feature-Driven Disease Named Entity Recognition Using Conditional Random Fields

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

D3NER: biomedical named entity recognition using CRF-biLSTM improved with fine-tuned embeddings of various linguistic information.
Thanh Hai Dang ... Hoang-Quynh Le
Bioinformatics | VOL. 34
Thanh Hai Dang, et. al.Thanh Hai Dang ... Hoang-Quynh Le
30 Apr 2018
Bioinformatics | VOL. 34

Document-level attention-based BiLSTM-CRF incorporating disease dictionary for disease named entity recognition
Kai Xu ... Wenyin Liu
Computers in Biology and Medicine | VOL. 108
Kai Xu, et. al.Kai Xu ... Wenyin Liu
07 Apr 2019
Computers in Biology and Medicine | VOL. 108

Selective Expression For Event Coreference Resolution on Twitter
Wenhan Chao ... Xiao Liu
-
Wenhan Chao, et. al.Wenhan Chao ... Xiao Liu
01 Jul 2019
01 Jul 2019

An Optimal Feature Set for Stylometry-based Style Change detection at Document and Sentence Level
Vivian Oloo ... Lilian D Wanzare
International Journal of Scientific Research in Computer Science, Engineering and Information Technology | VOL. -
Vivian Oloo, et. al. Vivian Oloo ... Lilian D Wanzare
15 Nov 2022
International Journal of Scientific Research in Computer Science, Engineering and Information Technology | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Advanced Feature-Driven Disease Named Entity Recognition Using Conditional Random Fields

Abstract

Talk to us

Similar Papers