Iterative development of family history annotation guidelines using a synthetic corpus of clinical text

Taraka Rama,Øystein Nytrø,Lilja Øvrelid,Pål Brekke

doi:10.18653/v1/w18-5613

Taraka Rama, Øystein Nytrø + Show 2 more

Open Access

https://doi.org/10.18653/v1/w18-5613

Copy DOI

Abstract

In this article, we describe the development of annotation guidelines for family history information in Norwegian clinical text. We make use of incrementally developed synthetic clinical text describing patients’ family history relating to cases of cardiac disease and present a general methodology which integrates the synthetically produced clinical statements and guideline development. We analyze inter-annotator agreement based on the developed guidelines and present results from experiments aimed at evaluating the validity and applicability of the annotated corpus using machine learning techniques. The resulting annotated corpus contains 477 sentences and 6030 tokens. Both the annotation guidelines and the annotated corpus are made freely available and as such constitutes the first publicly available resource of Norwegian clinical text.

Highlights

The limited availability of clinical text corpora constitutes a major challenge for the development of clinical NLP tools. Such text originates in the health record (EHR), and access to and use of the EHR is governed by strict data privacy and health service regulations, which usually restricts secondary use and prohibits re-distribution and sharing with the larger NLP community
This article describes the systematic development of annotation guidelines for family history information in Norwegian clinical text
Due to the unavailability of the real health records describing family histories, we developed a methodology for annotation guideline development which makes use of an incrementally developed synthetic corpus

Summary

Introduction

The limited availability of clinical text corpora constitutes a major challenge for the development of clinical NLP tools. Development of annotation guidelines is a time consuming process which in the case of clinical data often requires access to domain experts (clinicians). This article describes the systematic development of annotation guidelines for family history information in Norwegian clinical text. We make use of incrementally developed synthetic clinical text describing patients’ family history relating to cases of cardiac diseases. The domain expert is an integral part of this methodology and generates synthetic examples that challenge the guidelines and further participates both in the annotation and development of guidelines. Ysis (Hiekkalinna et al, 2005)

Previous work

Clinical entities

Span of annotations

Entity detection

Relation extraction

Findings

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Iterative development of family history annotation guidelines using a synthetic corpus of clinical text

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2018
Citations: 21	License type: cc-by

Similar Papers

Synthetic data for annotation and extraction of family history information from clinical text
Pål H Brekke ... Øystein Nytrø
Journal of biomedical semantics | VOL. 12
Pål H Brekke, et. al.Pål H Brekke ... Øystein Nytrø
14 Jul 2021
Journal of biomedical semantics | VOL. 12

Building a comprehensive syntactic and semantic corpus of Chinese clinical texts
Bin He ... Chunyan Qu
Journal of Biomedical Informatics | VOL. 69
Bin He, et. al.Bin He ... Chunyan Qu
09 Apr 2017
Journal of Biomedical Informatics | VOL. 69

Regulatory and legal status of clinical guidelines and their role in the quality control of medical care in countries of the European Union, North America and Asia
V K Fedyaeva ... A A Pashkina
FARMAKOEKONOMIKA. Modern Pharmacoeconomic and Pharmacoepidemiology | VOL. 12
V K Fedyaeva, et. al.V K Fedyaeva ... A A Pashkina
28 Oct 2019
FARMAKOEKONOMIKA. Modern Pharmacoeconomic and Pharmacoepidemiology | VOL. 12

Integrating planetary health into clinical guidelines to sustainably transform health care
Alina Herrmann ... Claudia Traidl-Hoffmann
The Lancet Planetary Health | VOL. 6
Alina Herrmann, et. al.Alina Herrmann ... Claudia Traidl-Hoffmann
01 Mar 2022
The Lancet Planetary Health | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Iterative development of family history annotation guidelines using a synthetic corpus of clinical text

Abstract

Highlights

Summary

Talk to us

Similar Papers