Clinical information extraction for lower-resource languages and domains with few-shot learning using pretrained language models and prompting

Phillip Richter-Pechanski,Philipp Wiesenbach,Dominic Mathias Schwab,Christina Kiriakou,Nicolas Geis,Christoph Dieterich,Anette Frank

doi:10.1017/nlp.2024.52

Abstract

Abstract A vast amount of clinical data are still stored in unstructured text. Automatic extraction of medical information from these data poses several challenges: high costs of clinical expertise, restricted computational resources, strict privacy regulations, and limited interpretability of model predictions. Recent domain adaptation and prompting methods using lightweight masked language models showed promising results with minimal training data and allow for application of well-established interpretability methods. We are first to present a systematic evaluation of advanced domain-adaptation and prompting methods in a lower-resource medical domain task, performing multi-class section classification on German doctor’s letters. We evaluate a variety of models, model sizes (further-pre)training and task settings, and conduct extensive class-wise evaluations supported by Shapley values to validate the quality of small-scale training data and to ensure interpretability of model predictions. We show that in few-shot learning scenarios, a lightweight, domain-adapted pretrained language model, prompted with just 20 shots per section class, outperforms a traditional classification model, by increasing accuracy from $48.6\%$ to $79.1\%$ . By using Shapley values for model selection and training data optimization, we could further increase accuracy up to $84.3\%$ . Our analyses reveal that pretraining of masked language models on general-language data is important to support successful domain-transfer to medical language, so that further-pretraining of general-language models on domain-specific documents can outperform models pretrained on domain-specific data only. Our evaluations show that applying prompting based on general-language pretrained masked language models combined with further-pretraining on medical-domain data achieves significant improvements in accuracy beyond traditional models with minimal training data. Further performance improvements and interpretability of results can be achieved, using interpretability methods such as Shapley values. Our findings highlight the feasibility of deploying powerful machine learning methods in clinical settings and can serve as a process-oriented guideline for lower-resource languages and domains such as clinical information extraction projects.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Clinical information extraction for lower-resource languages and domains with few-shot learning using pretrained language models and prompting

Abstract

Talk to us

Similar Papers

More From: Natural Language Processing

Lead the way for us

Journal: Natural Language Processing	Publication Date: Oct 31, 2024
License type: CC BY 4.0

Similar Papers

Neural Transfer Learning For Vietnamese Sentiment Analysis Using Pre-trained Contextual Language Models
An Pha Le ... Tran Vu Pham
-
An Pha Le, et. al.An Pha Le ... Tran Vu Pham
16 Dec 2021
16 Dec 2021

Better Few-Shot Text Classification with Pre-trained Language Model
Zheng Chen ... Yunchen Zhang
-
Zheng Chen, et. al.Zheng Chen ... Yunchen Zhang
01 Jan 2020
01 Jan 2020

Towards an Enhanced Understanding of Bias in Pre-trained Neural Language Models: A Survey with Special Emphasis on Affective Bias
Anoop K ... Lajish V L
-
Anoop K, et. al. Anoop K ... Lajish V L
01 Jan 2021
01 Jan 2021

Surgicberta: a pre-trained language model for procedural surgical language
Marco Bombieri ... Simone Paolo Ponzetto
International Journal of Data Science and Analytics | VOL. 18
Marco Bombieri, et. al.Marco Bombieri ... Simone Paolo Ponzetto
16 Aug 2023
International Journal of Data Science and Analytics | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Clinical information extraction for lower-resource languages and domains with few-shot learning using pretrained language models and prompting

Abstract

Talk to us

Similar Papers

More From: Natural Language Processing