Detection of medical text semantic similarity based on convolutional neural network

Tao Zheng,Handong Ma,Yimei Gao,Ya Zhang,Mei Li,Xingzhi Fu,Shaodian Zhang,Chenhao Fan,Fei Wang

doi:10.1186/s12911-019-0880-2

Abstract

BackgroundImaging examinations, such as ultrasonography, magnetic resonance imaging and computed tomography scans, play key roles in healthcare settings. To assess and improve the quality of imaging diagnosis, we need to manually find and compare the pre-existing reports of imaging and pathology examinations which contain overlapping exam body sites from electrical medical records (EMRs). The process of retrieving those reports is time-consuming. In this paper, we propose a convolutional neural network (CNN) based method which can better utilize semantic information contained in report texts to accelerate the retrieving process.MethodsWe included 16,354 imaging and pathology report-pairs from 1926 patients who admitted to Shanghai Tongren Hospital and had ultrasonic examinations between 1st May 2017 and 31st July 2017. We adapted the CNN model to calculate the similarities among the report-pairs to identify target report-pairs with overlapping body sites, and compared the performance with other six conventional models, including keyword mapping, latent semantic analysis (LSA), latent Dirichlet allocation (LDA), Doc2Vec, Siamese long short term memory (LSTM) and a model based on named entity recognition (NER). We also utilized graph embedding method to enhance the word representation by capturing the semantic relations information from medical ontologies. Additionally, we used LIME algorithm to identify which features (or words) are decisive for the prediction results and improved the model interpretability.ResultsExperiment results showed that our CNN model gained significant improvement compared to all other conventional models on area under the receiver operating characteristic (AUROC), precision, recall and F1-score in our test dataset. The AUROC of our CNN models gained approximately 3–7% improvement. The AUROC of CNN model with graph-embedding and ontology based medical concept vectors was 0.8% higher than the model with randomly initialized vectors and 1.5% higher than the one with pre-trained word vectors.ConclusionOur study demonstrates that CNN model with pre-trained medical concept vectors could accurately identify target report-pairs with overlapping body sites and potentially accelerate the retrieving process for imaging diagnosis quality measurement.

Highlights

Imaging examinations, such as ultrasonography, magnetic resonance imaging and computed tomography scans, play key roles in healthcare settings
The Area under the receiver operating characteristic (ROC) curve (AUC) score of our convolutional neural network (CNN) models with both randomly initialized vectors and pretrained word vectors were superior than that of any other baseline models, and gained approximately 3–7% improvement
We have done t-test to the AUC results from 50 independent runs of CNN with or without pre-trained medical concept vectors and the p-value is smaller than 0.001, which suggests the improvement is significant

Summary

Introduction

Imaging examinations, such as ultrasonography, magnetic resonance imaging and computed tomography scans, play key roles in healthcare settings. To assess and improve the quality of imaging diagnosis, we need to manually find and compare the pre-existing reports of imaging and pathology examinations which contain overlapping exam body sites from electrical medical records (EMRs). There could be discrepancies in such complicated and heterogeneous information (e.g., the diagnosis in patient’s radiology report is different than the one his/her really has), which may lead to imprecise clinical decisions [1] Such discrepancies could be inevitable due to the complexity of imaging-diagnosis, quality measurement and improvement are still needed to minimize avoidable error via a manual verification process. Only few patients receiving imaging examinations on certain body site will have surgical or pathologic biopsy on the same site To find these patients, quality control staff will regularly and manually review electrical medical records (EMRs) and scan related examination reports, which is inefficient and time consuming. We propose a machine learning based approach to retrieve these patients from EMRs more efficiently

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Medical Informatics and Decision Making	Publication Date: Aug 7, 2019
Citations: 21	License type: open-access

R Discovery Prime

R Discovery Prime

Detection of medical text semantic similarity based on convolutional neural network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Medical Informatics and Decision Making

Lead the way for us

Similar Papers

Tunnel boring machine vibration-based deep learning for the ground identification of working faces
Mengbo Liu ... Yongliang Huang
Journal of Rock Mechanics and Geotechnical Engineering | VOL. 13
Mengbo Liu, et. al.Mengbo Liu ... Yongliang Huang
01 Dec 2021
Journal of Rock Mechanics and Geotechnical Engineering | VOL. 13

Short-term water quality variable prediction using a hybrid CNN–LSTM deep learning model
Rahim Barzegar ... Mohammad Taghi Aalami
Stochastic Environmental Research and Risk Assessment | VOL. 34
Rahim Barzegar, et. al.Rahim Barzegar ... Mohammad Taghi Aalami
01 Feb 2020
Stochastic Environmental Research and Risk Assessment | VOL. 34

An Investigation of Deep Learning Models for EEG-Based Emotion Recognition.
Yaqing Zhang ... Xin Huang
Frontiers in Neuroscience | VOL. 14
Yaqing Zhang, et. al.Yaqing Zhang ... Xin Huang
23 Dec 2020
Frontiers in Neuroscience | VOL. 14

Sign Language Detection Using CNN and LSTM Based Model
Bhavya Chauhan ...
INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT | VOL. 08
Bhavya Chauhan, et. al.Bhavya Chauhan ...
09 May 2024
INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT | VOL. 08

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Detection of medical text semantic similarity based on convolutional neural network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Medical Informatics and Decision Making