Deep learning uncertainty quantification for clinical text classification

Alina Peluso,Stephen Schwartz,Charles Wiggins,Ioana Danciu,Jennifer Doherty,Jamaludin Mohd Yusof,Lynne Penberthy,Antoinette Stroup,Eric B Durbin,Noah Schaefferkoetter,Tanmoy Bhattacharya,Adam Spannaus,Xiao-Cheng Wu,Georgia D Tourassi,Shang Gao,Linda Coyle,Hong-Jun Yoon

doi:10.1016/j.jbi.2023.104576

Abstract

Introduction:Machine learning algorithms are expected to work side-by-side with humans in decision-making pipelines. Thus, the ability of classifiers to make reliable decisions is of paramount importance. Deep neural networks (DNNs) represent the state-of-the-art models to address real-world classification. Although the strength of activation in DNNs is often correlated with the network’s confidence, in-depth analyses are needed to establish whether they are well calibrated. Method:In this paper, we demonstrate the use of DNN-based classification tools to benefit cancer registries by automating information extraction of disease at diagnosis and at surgery from electronic text pathology reports from the US National Cancer Institute (NCI) Surveillance, Epidemiology, and End Results (SEER) population-based cancer registries. In particular, we introduce multiple methods for selective classification to achieve a target level of accuracy on multiple classification tasks while minimizing the rejection amount—that is, the number of electronic pathology reports for which the model’s predictions are unreliable. We evaluate the proposed methods by comparing our approach with the current in-house deep learning-based abstaining classifier. Results:Overall, all the proposed selective classification methods effectively allow for achieving the targeted level of accuracy or higher in a trade-off analysis aimed to minimize the rejection rate. On in-distribution validation and holdout test data, with all the proposed methods, we achieve on all tasks the required target level of accuracy with a lower rejection rate than the deep abstaining classifier (DAC). Interpreting the results for the out-of-distribution test data is more complex; nevertheless, in this case as well, the rejection rate from the best among the proposed methods achieving 97% accuracy or higher is lower than the rejection rate based on the DAC. Conclusions:We show that although both approaches can flag those samples that should be manually reviewed and labeled by human annotators, the newly proposed methods retain a larger fraction and do so without retraining—thus offering a reduced computational cost compared with the in-house deep learning-based abstaining classifier.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of biomedical informatics	Publication Date: Dec 13, 2023
Citations: 1	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Deep learning uncertainty quantification for clinical text classification

Abstract

Talk to us

Similar Papers

More From: Journal of biomedical informatics

Lead the way for us

Similar Papers

Roy Hertz
Ivan Oransky
The Lancet | VOL. 360
Ivan OranskyIvan Oransky
01 Dec 2002
The Lancet | VOL. 360

Abstract LB084: High cardiovascular disease mortality after central nervous system tumor diagnosis: Evidence from UK and USA population-based study
Kai Jin ... Jonine Figueroa
Cancer Research | VOL. 81
Kai Jin, et. al.Kai Jin ... Jonine Figueroa
01 Jul 2021
Cancer Research | VOL. 81

Impact of racial disparities on potential years of life lost due to gynecologic cancer in the United States: trends from 1975 to 2017 based on SEER database
Anahat Kaur ... Abhishek Kumar
Gynecologic Oncology | VOL. 162
Anahat Kaur, et. al.Anahat Kaur ... Abhishek Kumar
01 Aug 2021
Gynecologic Oncology | VOL. 162

Trends in glassy cell cervical cancer in the United States from 1973-2015: Analysis based on SEER database.
Anahat Kaur ... Shuai Wang
Journal of Clinical Oncology | VOL. 39
Anahat Kaur, et. al.Anahat Kaur ... Shuai Wang
20 May 2021
Trends in glassy cell cervical cancer in the United States from 1973-2015: Analysis based on SEER database.
Anahat Kaur ... Shuai Wang

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep learning uncertainty quantification for clinical text classification

Abstract

Talk to us

Similar Papers

More From: Journal of biomedical informatics