A real-world demonstration of machine learning generalizability in the detection of intracranial hemorrhage on head computerized tomography

Hojjat Salehinejad,Suradech Suthiphosuwan,Aditya Bharatha,Noah Ditkofsky,Hui-Ming Lin,Jumpei Kitamura,Jefferson R Wilson,Amy Lin,Muhammad Mamdani,Errol Colak

doi:10.1038/s41598-021-95533-2

Hojjat Salehinejad, Suradech Suthiphosuwan + Show 8 more

Open Access

https://doi.org/10.1038/s41598-021-95533-2

Copy DOI

Abstract

Machine learning (ML) holds great promise in transforming healthcare. While published studies have shown the utility of ML models in interpreting medical imaging examinations, these are often evaluated under laboratory settings. The importance of real world evaluation is best illustrated by case studies that have documented successes and failures in the translation of these models into clinical environments. A key prerequisite for the clinical adoption of these technologies is demonstrating generalizable ML model performance under real world circumstances. The purpose of this study was to demonstrate that ML model generalizability is achievable in medical imaging with the detection of intracranial hemorrhage (ICH) on non-contrast computed tomography (CT) scans serving as the use case. An ML model was trained using 21,784 scans from the RSNA Intracranial Hemorrhage CT dataset while generalizability was evaluated using an external validation dataset obtained from our busy trauma and neurosurgical center. This real world external validation dataset consisted of every unenhanced head CT scan (n = 5965) performed in our emergency department in 2019 without exclusion. The model demonstrated an AUC of 98.4%, sensitivity of 98.8%, and specificity of 98.0%, on the test dataset. On external validation, the model demonstrated an AUC of 95.4%, sensitivity of 91.3%, and specificity of 94.1%. Evaluating the ML model using a real world external validation dataset that is temporally and geographically distinct from the training dataset indicates that ML generalizability is achievable in medical imaging applications.

Highlights

Machine learning (ML) holds great promise in transforming healthcare
Evaluation of ML model performance on the test dataset revealed an AUC of 98.4%, a balanced accuracy of 98.4%, an imbalanced accuracy of 98.3%, sensitivity of 98.8%, specificity of 98.0%, positive predictive value of 96.5% and negative predictive value of 99.3% for intracranial hemorrhage (ICH) detection (Table 2)
Receiver operating characteristic (ROC) curves were created using the external dataset by generated probabilities per ICH type (Fig. 4a) and generated decisions after applying thresholds (Fig. 4b)

Summary

Introduction

Machine learning (ML) holds great promise in transforming healthcare. While published studies have shown the utility of ML models in interpreting medical imaging examinations, these are often evaluated under laboratory settings. The purpose of this study was to demonstrate that ML model generalizability is achievable in medical imaging with the detection of intracranial hemorrhage (ICH) on non-contrast computed tomography (CT) scans serving as the use case. The importance of real world evaluation is best demonstrated by case studies which have shown failures in translation from laboratory to clinical settings due to a variety of sociotechnical factors[16] Another limitation of many of these studies is a common source of the training, validation, and test datasets. We developed an ML model for ICH detection in non-contrast CTs of the head and examined generalization performance in the real world setting of a major neurosurgical and trauma center. This is the first study to both develop and assess generalization performance of an ML model for ICH detection

Objectives

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: Aug 23, 2021
Citations: 36	License type: open-access

R Discovery Prime

R Discovery Prime

A real-world demonstration of machine learning generalizability in the detection of intracranial hemorrhage on head computerized tomography

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

Machine Learning Models for Blood Glucose Level Prediction in Patients With Diabetes Mellitus: Systematic Review and Network Meta-Analysis.
Kui Liu ... Zhenhua Liu
JMIR Medical Informatics | VOL. 11
Kui Liu, et. al.Kui Liu ... Zhenhua Liu
20 Nov 2023
JMIR Medical Informatics | VOL. 11

Does Artificial Intelligence Outperform Natural Intelligence in Interpreting Musculoskeletal Radiological Studies? A Systematic Review.
Olivier Q Groot ... Jos A M Bramer
Clinical Orthopaedics & Related Research | VOL. 478
Olivier Q Groot, et. al.Olivier Q Groot ... Jos A M Bramer
30 Jul 2020
Clinical Orthopaedics & Related Research | VOL. 478

Application of machine learning for lung cancer survival prognostication-A systematic review and meta-analysis.
Alexander J Didier ... Mohamed A Omballi
Frontiers in Artificial Intelligence | VOL. 7
Alexander J Didier, et. al.Alexander J Didier ... Mohamed A Omballi
05 Apr 2024
Frontiers in Artificial Intelligence | VOL. 7

Evaluating external generalizability of machine learning models for recycled aggregate concrete property prediction
Shreyas Pandurang Jadhav ... Nikhil Bugalia
Journal of Cleaner Production | VOL. 469
Shreyas Pandurang Jadhav, et. al.Shreyas Pandurang Jadhav ... Nikhil Bugalia
15 Jul 2024
Journal of Cleaner Production | VOL. 469

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A real-world demonstration of machine learning generalizability in the detection of intracranial hemorrhage on head computerized tomography

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports