Finding relevant free-text radiology reports at scale with IBM Watson Content Analytics: a feasibility study in the UK NHS

Alicja Piotrkowicz,Geoff Hall,Owen Johnson

doi:10.1186/s13326-019-0213-5

Alicja Piotrkowicz, Geoff Hall + Show 1 more

Open Access

https://doi.org/10.1186/s13326-019-0213-5

Copy DOI

Abstract

BackgroundSignificant amounts of health data are stored as free-text within clinical reports, letters, discharge summaries and notes. Busy clinicians have limited time to read such large amounts of free-text and are at risk of information overload and consequently missing information vital to patient care. Automatically identifying relevant information at the point of care has the potential to reduce these risks but represents a considerable research challenge. One software solution that has been proposed in industry is the IBM Watson analytics suite which includes rule-based analytics capable of processing large document collections at scale.ResultsIn this paper we present an overview of IBM Watson Content Analytics and a feasibility study using Content Analytics with a large-scale corpus of clinical free-text reports within a UK National Health Service (NHS) context. We created dictionaries and rules for identifying positive incidence of hydronephrosis and brain metastasis from 5.6 m radiology reports and were able to achieve 94% precision, 95% recall and 89% precision, 94% recall respectively on a sample of manually annotated reports. With minor changes for US English we applied the same rule set to an open access corpus of 0.5 m radiology reports from a US hospital and achieved 93% precision, 94% recall and 84% precision, 88% recall respectively.ConclusionsWe were able to implement IBM Watson within a UK NHS context and demonstrate effective results that could provide clinicians with an automatic safety net which highlights clinically important information within free-text documents. Our results suggest that currently available technologies such as IBM Watson Content Analytics already have the potential to address information overload and improve clinical safety and that solutions developed in one hospital and country may be transportable to different hospitals and countries. Our study was limited to exploring technical aspects of the feasibility of one industry solution and we recognise that healthcare text analytics research is a fast-moving field. That said, we believe our study suggests that text analytics is sufficiently advanced to be implemented within industry solutions that can improve clinical safety.

Highlights

Significant amounts of health data are stored as free-text within clinical reports, letters, discharge summaries and notes
Contributions Our contributions are as follows: (i) we present an implementation of a large-scale commercial text analytics system which uses National Health Service (NHS) data, (ii) we present an overview of IBM Watson Content Analytics and the results of a case study in the radiology domain, and (iii) we show that a task-specific model generalises across radiology reports in two different countries (US and UK) for the two conditions we chose
Results we present the results of our investigation using IBM Watson Content Analytics for processing the free-text clinical reports in our case study

Summary

Introduction

Significant amounts of health data are stored as free-text within clinical reports, letters, discharge summaries and notes. Busy clinicians have limited time to read such large amounts of free-text and are at risk of information overload and missing information vital to patient care. For busy clinicians with limited time the requirement to read large amounts of free-text presents a risk of information overload and missing information vital to the care of their patients. In the past such documents were often handwritten and stored on paper but in parallel with advances in electronic health records (EHR) many of these free text documents are generated digitally and stored within, or linked to, the EHR [1]

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Biomedical Semantics	Publication Date: Nov 1, 2019
Citations: 5	License type: open-access

R Discovery Prime

R Discovery Prime

Finding relevant free-text radiology reports at scale with IBM Watson Content Analytics: a feasibility study in the UK NHS

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Biomedical Semantics

Lead the way for us

Similar Papers

Leveraging GPT-4 for Post Hoc Transformation of Free-text Radiology Reports into Structured Reporting: A Multilingual Feasibility Study.
Lisa C Adams ... Stefan M Niehues
Radiology | VOL. 307
Lisa C Adams, et. al.Lisa C Adams ... Stefan M Niehues
04 Apr 2023
Radiology | VOL. 307

Rule-based natural language processing for automation of stroke data extraction: a validation study.
Dane Gunter ... Amy Y X Yu
Neuroradiology | VOL. 64
Dane Gunter, et. al.Dane Gunter ... Amy Y X Yu
01 Aug 2022
Neuroradiology | VOL. 64

The NHS: what are the UK's political parties promising?
Emma Wilkinson
The Lancet | VOL. 385
Emma WilkinsonEmma Wilkinson
01 Mar 2015
The Lancet | VOL. 385

Ensuring that the NHS realises fair financial value from its data
Gianluca Fontana ... Ara Darzi
The Lancet Digital Health | VOL. 2
Gianluca Fontana, et. al.Gianluca Fontana ... Ara Darzi
23 Dec 2019
The Lancet Digital Health | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Finding relevant free-text radiology reports at scale with IBM Watson Content Analytics: a feasibility study in the UK NHS

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Biomedical Semantics