Automated labelling of radiology reports using natural language processing: Comparison of traditional and newer methods

Seo Yi Chng,Lionel T E Cheng,Paul J W Tern,Matthew R X Kan

doi:10.1002/hcs2.40

Abstract

AbstractAutomated labelling of radiology reports using natural language processing allows for the labelling of ground truth for large datasets of radiological studies that are required for training of computer vision models. This paper explains the necessary data preprocessing steps, reviews the main methods for automated labelling and compares their performance. There are four main methods of automated labelling, namely: (1) rules‐based text‐matching algorithms, (2) conventional machine learning models, (3) neural network models and (4) Bidirectional Encoder Representations from Transformers (BERT) models. Rules‐based labellers perform a brute force search against manually curated keywords and are able to achieve high F1 scores. However, they require proper handling of negative words. Machine learning models require preprocessing that involves tokenization and vectorization of text into numerical vectors. Multilabel classification approaches are required in labelling radiology reports and conventional models can achieve good performance if they have large enough training sets. Deep learning models make use of connected neural networks, often a long short‐term memory network, and are similarly able to achieve good performance if trained on a large data set. BERT is a transformer‐based model that utilizes attention. Pretrained BERT models only require fine‐tuning with small data sets. In particular, domain‐specific BERT models can achieve superior performance compared with the other methods for automated labelling.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Health Care Science	Publication Date: Apr 1, 2023
Citations: 4	License type: CC BY-NC 4.0

R Discovery Prime

R Discovery Prime

Automated labelling of radiology reports using natural language processing: Comparison of traditional and newer methods

Abstract

Talk to us

Similar Papers

More From: Health Care Science

Lead the way for us

Similar Papers

Bert model fine-tuning for text classification in knee OA radiology reports
L Chen ... V Pedoia
Osteoarthritis and Cartilage | VOL. 28
L Chen, et. al.L Chen ... V Pedoia
01 Apr 2020
Osteoarthritis and Cartilage | VOL. 28

Engineering Document Summarization Using Sentence Representations Generated by Bidirectional Language Model
Yunjian Qiu ... Yan Jin
-
Yunjian Qiu, et. al.Yunjian Qiu ... Yan Jin
17 Aug 2021
17 Aug 2021

Oversampling effect in pretraining for bidirectional encoder representations from transformers (BERT) to localize medical BERT and enhance biomedical BERT
Shoya Wada ... Yasushi Matsumura
Artificial Intelligence In Medicine | VOL. 153
Shoya Wada, et. al.Shoya Wada ... Yasushi Matsumura
05 May 2024
Artificial Intelligence In Medicine | VOL. 153

An Analysis of BERT (NLP) for Assisted Subject Indexing for Project Gutenberg
Charlene Chou ... Tony Chu
Cataloging & Classification Quarterly | VOL. 60
Charlene Chou, et. al.Charlene Chou ... Tony Chu
21 Oct 2022
Cataloging & Classification Quarterly | VOL. 60

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automated labelling of radiology reports using natural language processing: Comparison of traditional and newer methods

Abstract

Talk to us

Similar Papers

More From: Health Care Science