DeIDNER Corpus: Annotation of Clinical Discharge Summary Notes for Named Entity Recognition Using BRAT Tool.

Mahanazuddin Syed,Shorabuddin Syed,Melody L Greer,Meredith Zozus,Shaymaa Al-Shukri,Sudeepa Bhattacharyya,Fred Prior,Kevin Sexton

doi:10.3233/shti210195

Abstract

Named Entity Recognition (NER) aims to identify and classify entities into predefined categories is a critical pre-processing task in Natural Language Processing (NLP) pipeline. Readily available off-the-shelf NER algorithms or programs are trained on a general corpus and often need to be retrained when applied on a different domain. The end model’s performance depends on the quality of named entities generated by these NER models used in the NLP task. To improve NER model accuracy, researchers build domain-specific corpora for both model training and evaluation. However, in the clinical domain, there is a dearth of training data because of privacy reasons, forcing many studies to use NER models that are trained in the non-clinical domain to generate NER feature-set. Thus, influencing the performance of the downstream NLP tasks like information extraction and de-identification. In this paper, our objective is to create a high quality annotated clinical corpus for training NER models that can be easily generalizable and can be used in a downstream de-identification task to generate named entities feature-set.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Studies in health technology and informatics	Publication Date: May 27, 2021
Citations: 8	License type: CC BY-NC 4.0

R Discovery Prime

R Discovery Prime

DeIDNER Corpus: Annotation of Clinical Discharge Summary Notes for Named Entity Recognition Using BRAT Tool.

Abstract

Talk to us

Similar Papers

More From: Studies in health technology and informatics

Lead the way for us

Similar Papers

Automatic Extraction of Comprehensive Drug Safety Information from Adverse Drug Event Narratives in the Korea Adverse Event Reporting System Using Natural Language Processing Techniques.
Siun Kim ... Yesol Hong
Drug Safety | VOL. 46
Siun Kim, et. al.Siun Kim ... Yesol Hong
17 Jun 2023
Drug Safety | VOL. 46

Enhancing the Performance of Telugu Named Entity Recognition Using Gazetteer Features
Saikiranmai Gorla ... Aruna Malapati
Information | VOL. 11
Saikiranmai Gorla, et. al.Saikiranmai Gorla ... Aruna Malapati
02 Feb 2020
Information | VOL. 11

Semi-supervised disentangled framework for transferable named entity recognition
Zhifeng Hao ... Boyan Xu
Neural Networks | VOL. 135
Zhifeng Hao, et. al.Zhifeng Hao ... Boyan Xu
13 Dec 2020
Neural Networks | VOL. 135

Improving Chinese Named Entity Recognition by Large-Scale Syntactic Dependency Graph
Peng Zhu ... Fangzhou Yang
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30
Peng Zhu, et. al.Peng Zhu ... Fangzhou Yang
01 Jan 2021
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DeIDNER Corpus: Annotation of Clinical Discharge Summary Notes for Named Entity Recognition Using BRAT Tool.

Abstract

Talk to us

Similar Papers

More From: Studies in health technology and informatics