Abstract

The project’s goal is to extract tumour measurement data from oncological radiology reports with equal, or improved accuracy of a human radiologist. The purpose is to streamline and improve the efficiency of cancer diagnosis, accomplishing this through methods in artificial intelligence. In this experiment, a collection of 85,218 colorectal, and lung radiology reports were used. After loading reports into a data frame, and a BioBERT (bidirectional encoder representations from transformers pre-trained on biomedical corpora) model into a virtual environment, a question is set to be answered by the model where the context of the question is each radiology report’s findings section of the specified organ. These inputs are tokenized and embedded into numerical values to map sentences to vectors of real numbers. Vectors are fed into the model, and an answer is output in natural language for human readability. Answers are stored in the data frame in their corresponding row from which they were derived. The model successfully answered questions about measurements of tumours written in free text in reports where tumours were present, and successfully ignored or did not report in cases where tumours were not present, or measurements were unchanged. In cases with multiple tumours, the model reported exclusively first listed measurements. In an updated version of this model, context will be run through sentence-wise to ensure equal attention to the context entirely. This project is evidence that using one question and selecting the findings portion of a radiology report for one organ as context in a question-answering model built using BioBERT is effective, and efficient in collecting measurements from radiology reports. This algorithm can be applied to other areas of medicine, or other fields entirely with a few model alterations. This project is a step forward in improving cancer diagnosis efficiency and improving medical AI.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call