Large multimodal model-based standardisation of pathology reports with confidence and its prognostic significance.

Ethar Alzaid,Gabriele Pergola,Harriet Evans,David Snead,Fayyaz Minhas

doi:10.1002/2056-4538.70010

Abstract

Despite the existence of established standards and guidelines for pathology reporting, many pathology reports are still written in unstructured free text. Extracting information from these reports and formatting it according to a standard is crucial for consistent interpretation. Automated information extraction from unstructured pathology reports is a challenging task, as it requires accurately interpreting medical terminologies and context-dependent details. In this work, we present a practical approach for automatically extracting information from unstructured pathology reports or scanned paper reports utilising a large multimodal model. This framework uses context-aware prompting strategies to extract values of individual fields, such as grade, size, etc. from pathology reports. A unique feature of the proposed approach is that it assigns a confidence value indicating the correctness of the model's extraction for each field and generates a structured report in line with national pathology guidelines in human and machine-readable formats. We have analysed the extraction performance in terms of accuracy and kappa scores, and the quality of the confidence scores assigned by the model. We have also evaluated the prognostic value of the extracted fields and feature embeddings of the raw text. Results showed that the model can accurately extract information with an accuracy and kappa score up to 0.99 and 0.98, respectively. Our results indicate that confidence scores are an effective indicator of the correctness of the extracted information achieving an area under the receiver operating characteristic curve up to 0.93 thus enabling automatic flagging of extraction errors. Our analysis further reveals that, as expected, information extracted from pathology reports is highly prognostically relevant. The framework demo is available at: https://labieb.dcs.warwick.ac.uk/. Information extracted from pathology reports of colorectal cancer cases in the cancer genome atlas using the proposed approach and its code are available at: https://github.com/EtharZaid/Labieb.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Large multimodal model-based standardisation of pathology reports with confidence and its prognostic significance.

Abstract

Published Version

Talk to us

Similar Papers

More From: The journal of pathology. Clinical research

Lead the way for us

Journal: The journal of pathology. Clinical research	Publication Date: Nov 1, 2024
License type: CC BY 4.0

Similar Papers

Extraction of Tumor Site from Cancer Pathology Reports using Deep Filters
Abhishek K Dubey ... Georgia Tourassi
-
Abhishek K Dubey, et. al.Abhishek K Dubey ... Georgia Tourassi
04 Sep 2019
04 Sep 2019

Automatic extraction of cancer registry reportable information from free-text pathology reports using multitask convolutional neural networks.
Mohammed Alawad ... J Blair Christian
Journal of the American Medical Informatics Association | VOL. 27
Mohammed Alawad, et. al.Mohammed Alawad ... J Blair Christian
09 Nov 2019
Journal of the American Medical Informatics Association | VOL. 27

Structured Information Extraction of Pathology Reports with Attention-based Graph Convolutional Network
Jialun Wu ... Chen Li
-
Jialun Wu, et. al.Jialun Wu ... Chen Li
16 Dec 2020
16 Dec 2020

"Clinicians Are From Mars and Pathologists Are From Venus" Revisited: Synoptic Reports Improve Clinician Comprehension of Pathology Reports in Inflammatory Bowel Disease.
Stephanie L Yan ... Kurt B Schaberg
Archives of pathology & laboratory medicine | VOL. 148
Stephanie L Yan, et. al.Stephanie L Yan ... Kurt B Schaberg
03 Oct 2023
Archives of pathology & laboratory medicine | VOL. 148

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Large multimodal model-based standardisation of pathology reports with confidence and its prognostic significance.

Abstract

Published Version

Talk to us

Similar Papers

More From: The journal of pathology. Clinical research