A comparison of few-shot and traditional named entity recognition models for medical text.

Yao Ge,Mohammed Ali Al-Garadi,Abeed Sarker,Yuan-Chi Yang,Yuting Guo

doi:10.1109/ichi54592.2022.00024

Abstract

Many research problems involving medical texts have limited amounts of annotated data available (e.g., expressions of rare diseases). Traditional supervised machine learning algorithms, particularly those based on deep neural networks, require large volumes of annotated data, and they underperform when only small amounts of labeled data are available. Few-shot learning (FSL) is a category of machine learning models that are designed with the intent of solving problems that have small annotated datasets available. However, there is no current study that compares the performances of FSL models with traditional models (e.g., conditional random fields) for medical text at different training set sizes. In this paper, we attempted to fill this gap in research by comparing multiple FSL models with traditional models for the task of named entity recognition (NER) from medical texts. Using five health-related annotated NER datasets, we benchmarked three traditional NER models based on BERT-BERT-Linear Classifier (BLC), BERT-CRF (BC) and SANER; and three FSL NER models-StructShot & NNShot, Few-Shot Slot Tagging (FS-ST) and ProtoNER. Our benchmarking results show that almost all models, whether traditional or FSL, achieve significantly lower performances compared to the state-of-the-art with small amounts of training data. For the NER experiments we executed, the F1-scores were very low with small training sets, typically below 30%. FSL models that were reported to perform well on non-medical texts significantly underperformed, compared to their reported best, on medical texts. Our experiments also suggest that FSL methods tend to perform worse on data sets from noisy sources of medical texts, such as social media (which includes misspellings and colloquial expressions), compared to less noisy sources such as medical literature. Our experiments demonstrate that the current state-of-the-art FSL systems are not yet suitable for effective NER in medical natural language processing tasks, and further research needs to be carried out to improve their performances. Creation of specialized, standardized datasets replicating real-world scenarios may help to move this category of methods forward.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A comparison of few-shot and traditional named entity recognition models for medical text.

Abstract

Talk to us

Similar Papers

More From: IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics

Lead the way for us

Journal: IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics	Publication Date: Jun 1, 2022
Citations: 1

Similar Papers

A hybrid few-shot multiple-instance learning model predicting the aggressiveness of lymphoma in PET/CT images
Caiwen Xu ... Guojun Zhang
Computer methods and programs in biomedicine | VOL. 243
Caiwen Xu, et. al.Caiwen Xu ... Guojun Zhang
17 Oct 2023
Computer methods and programs in biomedicine | VOL. 243

Auto Parts Defect Detection Based on Few-shot Learning
Jiancheng Xu ... Jialei Ma
-
Jiancheng Xu, et. al.Jiancheng Xu ... Jialei Ma
20 May 2022
20 May 2022

Few-shot Self-optimization Learning Based on Deep Metric
Yong Ma ... Quansheng Dou
-
Yong Ma, et. al.Yong Ma ... Quansheng Dou
14 Dec 2020
14 Dec 2020

Named Entity Recognition of Medical Text Based on the Deep Neural Network.
Tianjiao Yang ... Alireza Souri
Journal of Healthcare Engineering | VOL. 2022
Tianjiao Yang, et. al.Tianjiao Yang ... Alireza Souri
07 Mar 2022
Journal of Healthcare Engineering | VOL. 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A comparison of few-shot and traditional named entity recognition models for medical text.

Abstract

Talk to us

Similar Papers

More From: IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics