Hidden Stratification Causes Clinically Meaningful Failures in Machine Learning for Medical Imaging.

Luke Oakden-Rayner,Christopher Re,Gustavo Carneiro,Jared Dunnmon

doi:10.1145/3368555.3384468

Abstract

Machine learning models for medical image analysis often suffer from poor performance on important subsets of a population that are not identified during training or testing. For example, overall performance of a cancer detection model may be high, but the model may still consistently miss a rare but aggressive cancer subtype. We refer to this problem as hidden stratification, and observe that it results from incompletely describing the meaningful variation in a dataset. While hidden stratification can substantially reduce the clinical efficacy of machine learning models, its effects remain difficult to measure. In this work, we assess the utility of several possible techniques for measuring hidden stratification effects, and characterize these effects both via synthetic experiments on the CIFAR-100 benchmark dataset and on multiple real-world medical imaging datasets. Using these measurement techniques, we find evidence that hidden stratification can occur in unidentified imaging subsets with low prevalence, low label quality, subtle distinguishing features, or spurious correlates, and that it can result in relative performance differences of over 20% on clinically important subsets. Finally, we discuss the clinical implications of our findings, and suggest that evaluation of hidden stratification should be a critical component of any machine learning deployment in medical imaging.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Hidden Stratification Causes Clinically Meaningful Failures in Machine Learning for Medical Imaging.

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM Conference on Health, Inference, and Learning

Lead the way for us

Journal: Proceedings of the ACM Conference on Health, Inference, and Learning	Publication Date: Apr 2, 2020
Citations: 234

Similar Papers

Machine Learning in Medical Imaging
Kenji Suzuki ... Pingkun Yan
International Journal of Biomedical Imaging | VOL. 2012
Kenji Suzuki, et. al.Kenji Suzuki ... Pingkun Yan
01 Jan 2012
International Journal of Biomedical Imaging | VOL. 2012

Machine Learning in Medical Imaging
Heung-Il Suk ... Pingkun Yan
-
Heung-Il Suk, et. al.Heung-Il Suk ... Pingkun Yan
01 Jan 2019
01 Jan 2019

Machine Learning in Medical Imaging
Qian Wang ... Dinggang Shen
IEEE Journal of Biomedical and Health Informatics | VOL. 23
Qian Wang, et. al.Qian Wang ... Dinggang Shen
01 Jul 2019
IEEE Journal of Biomedical and Health Informatics | VOL. 23

Can Sequential Images from the Same Object Be Used for Training Machine Learning Models? A Case Study for Detecting Liver Disease by Ultrasound Radiomics.
Laith R Sultan ... Charles-Antoine Assenmacher
AI (Basel, Switzerland) | VOL. 3
Laith R Sultan, et. al.Laith R Sultan ... Charles-Antoine Assenmacher
01 Sep 2022
AI (Basel, Switzerland) | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hidden Stratification Causes Clinically Meaningful Failures in Machine Learning for Medical Imaging.

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM Conference on Health, Inference, and Learning