Identifying Biases in a Multicenter MRI Database for Parkinson's Disease Classification: Is the Disease Classifier a Secret Site Classifier?

Raissa Souza,Matthias Wilms,Nils D Forkert,Emma A M Stanley,Anthony Winder,Milton Camacho,Vibujithan Vigneshwaran,Oury Monchi,Richard Camicioli

doi:10.1109/jbhi.2024.3352513

Abstract

Sharing multicenter imaging datasets can be advantageous to increase data diversity and size but may lead to spurious correlations between site-related biological and non-biological image features and target labels, which machine learning (ML) models may exploit as shortcuts. To date, studies analyzing how and if deep learning models may use such effects as a shortcut are scarce. Thus, the aim of this work was to investigate if site-related effects are encoded in the feature space of an established deep learning model designed for Parkinson's disease (PD) classification based on T1-weighted MRI datasets. Therefore, all layers of the PD classifier were frozen, except for the last layer of the network, which was replaced by a linear layer that was exclusively re-trained to predict three potential bias types (biological sex, scanner type, and originating site). Our findings based on a large database consisting of 1880 MRI scans collected across 41 centers show that the feature space of the established PD model (74% accuracy) can be used to classify sex (75% accuracy), scanner type (79% accuracy), and site location (71% accuracy) with high accuracies despite this information never being explicitly provided to the PD model during original training. Overall, the results of this study suggest that trained image-based classifiers may use unwanted shortcuts that are not meaningful for the actual clinical task at hand. This finding may explain why many image-based deep learning models do not perform well when applied to data from centers not contributing to the training set.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Journal of Biomedical and Health Informatics	Publication Date: Apr 1, 2024
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Identifying Biases in a Multicenter MRI Database for Parkinson's Disease Classification: Is the Disease Classifier a Secret Site Classifier?

Abstract

Talk to us

Similar Papers

More From: IEEE Journal of Biomedical and Health Informatics

Lead the way for us

Similar Papers

Exploring the Potential of Deep Learning in the Classification and Early Detection of Parkinson's Disease
V S Bakkialakshmi ... Hritwik Ghosh
EAI Endorsed Transactions on Pervasive Health and Technology | VOL. 10
V S Bakkialakshmi, et. al.V S Bakkialakshmi ... Hritwik Ghosh
27 Mar 2024
EAI Endorsed Transactions on Pervasive Health and Technology | VOL. 10

CCZO Residual GhostNet
Arogia Victor Paul M ... Sharmila Shankar
International journal of electrical and computer engineering systems | VOL. 15
Arogia Victor Paul M, et. al.Arogia Victor Paul M ... Sharmila Shankar
01 Jan 2024
International journal of electrical and computer engineering systems | VOL. 15

Explainable hypergraphs for gait based Parkinson classification
Anirban Dutta Choudhury ... Ananda S Chowdhury
Pattern Recognition Letters | VOL. 186
Anirban Dutta Choudhury, et. al.Anirban Dutta Choudhury ... Ananda S Chowdhury
01 Oct 2024
Pattern Recognition Letters | VOL. 186

Unsupervised Pre-trained Models from Healthy ADLs Improve Parkinson's Disease Classification of Gait Patterns.
Anirudh Som ... Matthew Buman
Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference | VOL. 2020
Anirudh Som, et. al.Anirudh Som ... Matthew Buman
01 Jul 2020
01 Jul 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Identifying Biases in a Multicenter MRI Database for Parkinson's Disease Classification: Is the Disease Classifier a Secret Site Classifier?

Abstract

Talk to us

Similar Papers

More From: IEEE Journal of Biomedical and Health Informatics