Identifying Biases in a Multicenter MRI Database for Parkinson's Disease Classification: Is the Disease Classifier a Secret Site Classifier?

  • Abstract
  • Highlights & Summary
  • PDF
  • References
  • Citations
  • Similar Papers
Abstract
Translate article icon Translate Article Star icon
Take notes icon Take Notes

Sharing multicenter imaging datasets can be advantageous to increase data diversity and size but may lead to spurious correlations between site-related biological and non-biological image features and target labels, which machine learning (ML) models may exploit as shortcuts. To date, studies analyzing how and if deep learning models may use such effects as a shortcut are scarce. Thus, the aim of this work was to investigate if site-related effects are encoded in the feature space of an established deep learning model designed for Parkinson's disease (PD) classification based on T1-weighted MRI datasets. Therefore, all layers of the PD classifier were frozen, except for the last layer of the network, which was replaced by a linear layer that was exclusively re-trained to predict three potential bias types (biological sex, scanner type, and originating site). Our findings based on a large database consisting of 1880 MRI scans collected across 41 centers show that the feature space of the established PD model (74% accuracy) can be used to classify sex (75% accuracy), scanner type (79% accuracy), and site location (71% accuracy) with high accuracies despite this information never being explicitly provided to the PD model during original training. Overall, the results of this study suggest that trained image-based classifiers may use unwanted shortcuts that are not meaningful for the actual clinical task at hand. This finding may explain why many image-based deep learning models do not perform well when applied to data from centers not contributing to the training set.

Highlights

  • S HARING multicenter medical imaging data is assumed to lead to several benefits, such as increasing interdisciplinary collaboration, avoiding duplication of clinical trials and data collection, supporting novel medical insights, and training more robust machine learning (ML) models for computer-aided diagnosis [1]

  • The results of this work suggest that the feature space of the Parkinson’s disease (PD) classifier encodes information from the biases investigated: sex, site, and scanner, even with this simple retraining setup of only replacing and retraining the final layer of the PD model with a single dense layer ’head’ while all other layers remained frozen

  • Even though the database as a whole was well-balanced (867 PD and 1013 healthy subjects (HS)) and the F1-score (73%), which considers the proportion of PD and HS in the testing set, was similar compared to the accuracy (74%) for PD vs. HS classification, the model achieved a considerably higher sensitivity (85%) than specificity (64% - Fig. 2 A)

Read more Highlights Expand/Collapse icon

Summary

IntroductionExpand/Collapse icon

S HARING multicenter medical imaging data is assumed to lead to several benefits, such as increasing interdisciplinary collaboration, avoiding duplication of clinical trials and data collection (e.g., healthy control cohort), supporting novel medical insights, and training more robust machine learning (ML) models for computer-aided diagnosis [1]. Compiling data from multiple acquisition sites can improve the diversity and size of the database as a whole, which is assumed to be beneficial for training and testing of ML models, especially for medical image analysis where data is often rare at single sites [3]–[6]. The utilization of large multicenter datasets is expected to enhance the performance of trained ML models, enable and improve the identification of subtle disease expressions in the data, and increase their generalizability to unseen data. A lack of intra-site data variation may result in spurious correlations between ’siterelated effects‘ (image features) and the target label, which ML models could exploit. For MRI-based neuroimaging data, these site-related effects or biases could originate from complex factors such as biological differences

ObjectivesExpand/Collapse icon
MethodsExpand/Collapse icon
ResultsExpand/Collapse icon
DiscussionExpand/Collapse icon
ConclusionExpand/Collapse icon
ReferencesShowing 10 of 29 papers
  • Cite Count Icon 22
  • 10.1016/j.media.2022.102610
Predicting treatment-specific lesion outcomes in acute ischemic stroke from 4D CT perfusion imaging using spatio-temporal convolutional neural networks.
  • Nov 1, 2022
  • Medical Image Analysis
  • Kimberly Amador + 4 more

  • Open Access Icon
  • Cite Count Icon 5371
  • 10.1109/tmi.2010.2046908
N4ITK: improved N3 bias correction.
  • Apr 8, 2010
  • IEEE Transactions on Medical Imaging
  • Nicholas J Tustison + 6 more

  • Cite Count Icon 6
  • 10.1007/978-3-030-87586-2_3
Towards Self-explainable Classifiers and Regressors in Neuroimaging with Normalizing Flows
  • Jan 1, 2021
  • Matthias Wilms + 5 more

  • Open Access Icon
  • PDF Download Icon
  • Cite Count Icon 43
  • 10.3389/fneur.2021.648548
Utility of Multi-Modal MRI for Differentiating of Parkinson's Disease and Progressive Supranuclear Palsy Using Machine Learning.
  • Apr 14, 2021
  • Frontiers in neurology
  • Aron S Talai + 3 more

  • Open Access Icon
  • Cite Count Icon 390
  • 10.1016/j.media.2020.101871
Accurate brain age prediction with lightweight deep neural networks.
  • Oct 19, 2020
  • Medical image analysis
  • Han Peng + 4 more

  • Open Access Icon
  • Cite Count Icon 9811
  • 10.1037/1082-989x.7.4.422
Mediation in experimental and nonexperimental studies: New procedures and recommendations.
  • Jan 1, 2002
  • Psychological Methods
  • Patrick E Shrout + 1 more

  • Open Access Icon
  • Cite Count Icon 37
  • 10.1007/s00330-017-5165-5
To share or not to share? Expected pros and cons of data sharing in radiological research
  • Jan 18, 2018
  • European Radiology
  • Francesco Sardanelli + 5 more

  • Open Access Icon
  • Cite Count Icon 57
  • 10.1007/978-3-030-87199-4_30
Style Transfer Using Generative Adversarial Networks for Multi-Site MRI Harmonization.
  • Jan 1, 2021
  • Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention
  • Mengting Liu + 6 more

  • Cite Count Icon 69
  • 10.1016/j.neuroimage.2008.09.053
Sensitivity of voxel-based morphometry analysis to choice of imaging protocol at 3 T
  • Oct 19, 2008
  • NeuroImage
  • Christine L Tardif + 2 more

  • Cite Count Icon 165
  • 10.1038/s41582-019-0294-x
Sex differences in movement disorders.
  • Jan 3, 2020
  • Nature Reviews Neurology
  • Sara Meoni + 2 more

CitationsShowing 10 of 16 papers
  • Research Article
  • 10.1109/access.2025.3563177
HAMF: A Novel Hierarchical Attention-Based Multi-Modal Fusion Model for Parkinson’s Disease Classification and Severity Prediction
  • Jan 1, 2025
  • IEEE Access
  • Anitha Rani Palakayala + 4 more

HAMF: A Novel Hierarchical Attention-Based Multi-Modal Fusion Model for Parkinson’s Disease Classification and Severity Prediction

  • Research Article
  • 10.1016/j.jbi.2025.104873
Accounting for population structure in deep learning models for genomic analysis.
  • Sep 1, 2025
  • Journal of biomedical informatics
  • Gabrielle Dagasso + 3 more

Accounting for population structure in deep learning models for genomic analysis.

  • Research Article
  • Cite Count Icon 2
  • 10.1109/access.2024.3487001
Enhancing Parkinson’s Disease Detection and Diagnosis: A Survey of Integrative Approaches Across Diverse Modalities
  • Jan 1, 2024
  • IEEE Access
  • C R Dhivyaa + 2 more

Parkinson's disease (PD) is a chronic neurodegenerative illness that affects the brain and central nervous system, leading to issues with pain, mobility, mood, and sleep. Early and precise diagnosis is vital for effective medical intervention and enhancing the quality level of life for patients. This review provides an extensive overview of current PD detection methods, emphasizing the integration of neuroimaging techniques, clinical data, and advanced computational algorithms. A wide range of neuroimaging modalities are examined by highlighting their roles in identifying structural and functional brain changes linked to PD. The review also explores the potential of datasets such as handwritten samples, Electroencephalography (EEG), Electrocardiography (ECG), voice recordings, gait analysis, and sensor data for PD detection. The review discusses various stages of data processing, including preprocessing, segmentation, and feature extraction, essential for improving the accuracy and efficiency of diagnostic models. The application of machine learning, deep learning, and transfer learning models for PD classification and prediction is reviewed, focusing on feature selection, model optimization, and the utilization of large, diverse datasets. AI systems can provide reliable and accurate Parkinson's disease (PD) identification, enabling more efficient and prompt therapeutic treatments, by utilizing these sophisticated algorithms and a variety of datasets.

  • Conference Article
  • 10.1117/12.3046811
Does a diffusion-based generative classifier avoid shortcut learning in medical image analysis? An initial investigation using synthetic neuroimaging data
  • Apr 10, 2025
  • Emma Stanley + 2 more

Does a diffusion-based generative classifier avoid shortcut learning in medical image analysis? An initial investigation using synthetic neuroimaging data

  • Book Chapter
  • 10.1007/978-3-031-72787-0_16
Assessing the Impact of Sociotechnical Harms in AI-Based Medical Image Analysis
  • Oct 13, 2024
  • Emma A M Stanley + 8 more

Assessing the Impact of Sociotechnical Harms in AI-Based Medical Image Analysis

  • Research Article
  • 10.1117/1.jmi.11.5.054502
HarmonyTM: multi-center data harmonization applied to distributed learning for Parkinson’s disease classification
  • Sep 1, 2024
  • Journal of Medical Imaging
  • Raissa Souza + 9 more

.PurposeDistributed learning is widely used to comply with data-sharing regulations and access diverse datasets for training machine learning (ML) models. The traveling model (TM) is a distributed learning approach that sequentially trains with data from one center at a time, which is especially advantageous when dealing with limited local datasets. However, a critical concern emerges when centers utilize different scanners for data acquisition, which could potentially lead models to exploit these differences as shortcuts. Although data harmonization can mitigate this issue, current methods typically rely on large or paired datasets, which can be impractical to obtain in distributed setups.ApproachWe introduced HarmonyTM, a data harmonization method tailored for the TM. HarmonyTM effectively mitigates bias in the model’s feature representation while retaining crucial disease-related information, all without requiring extensive datasets. Specifically, we employed adversarial training to “unlearn” bias from the features used in the model for classifying Parkinson’s disease (PD). We evaluated HarmonyTM using multi-center three-dimensional (3D) neuroimaging datasets from 83 centers using 23 different scanners.ResultsOur results show that HarmonyTM improved PD classification accuracy from 72% to 76% and reduced (unwanted) scanner classification accuracy from 53% to 30% in the TM setup.ConclusionHarmonyTM is a method tailored for harmonizing 3D neuroimaging data within the TM approach, aiming to minimize shortcut learning in distributed setups. This prevents the disease classifier from leveraging scanner-specific details to classify patients with or without PD—a key aspect for deploying ML models for clinical applications.

  • Research Article
  • 10.1101/2025.01.15.25320592
Predicting ADC Map Quality from T2-Weighted MRI: A Deep Learning Approach for Early Quality Assessment to Assist Point-of-Care
  • Jan 15, 2025
  • medRxiv
  • Jeffrey R Brender + 10 more

Purpose:Poor quality prostate MRI images, especially ADC maps, can lead to missed lesions and unnecessary repeat scans. To address this issue, we aimed to develop an automated method to predict ADC map quality from T2 images acquired earlier in the scanning process.Materials and Methods:A paired multi-site image corpus of T2-weighted images and ADC maps was constructed from 486 patients imaged in-house and at 62 external clinics. A senior radiologist assigned 1–3 quality ratings to each image set, later converted to a binary “non-diagnostic” or “diagnostic” scale. A deep learning model and a rectal cross-sectional area measurement approach were developed to predict ADC image quality from T2 images. Model performance was evaluated retrospectively by accuracy, sensitivity, negative and positive predictive value, and AUC.Results:No single acquisition parameter in the metadata was statistically associated with image quality for either T2 or ADC maps. Quality scores of the same modality showed low correlation across sites (r~0.2). In the challenging task of predicting ADC quality from prior T2 images, our model achieved performance comparable to current single-site models directly using ADC maps, with 83% sensitivity and 90% negative predictive value. The model showed stronger performance on in-house data (94±2% accuracy) despite being trained exclusively on multicenter external data. Rectal cross-sectional area on T2 images provided an interpretable quality metric (AUC 0.65).Conclusion:The probability of low quality, uninterpretable ADC maps can be inferred early in the imaging process by neural network approach, allowing corrective action to be employed.

  • Research Article
  • 10.1016/j.ejrad.2025.112317
Predicting ADC map quality from T2-weighted MRI: A deep learning approach for early quality assessment to assist point-of-care.
  • Oct 1, 2025
  • European journal of radiology
  • Jeffrey R Brender + 10 more

Predicting ADC map quality from T2-weighted MRI: A deep learning approach for early quality assessment to assist point-of-care.

  • Research Article
  • Cite Count Icon 1
  • 10.3389/fneur.2024.1423485
Analysis and visualization of the effect of multiple sclerosis on biological brain age.
  • Oct 10, 2024
  • Frontiers in neurology
  • Catharina J A Romme + 6 more

The rate of neurodegeneration in multiple sclerosis (MS) is an important biomarker for disease progression but can be challenging to quantify. The brain age gap, which quantifies the difference between a patient's chronological and their estimated biological brain age, might be a valuable biomarker of neurodegeneration in patients with MS. Thus, the aim of this study was to investigate the value of an image-based prediction of the brain age gap using a deep learning model and compare brain age gap values between healthy individuals and patients with MS. A multi-center dataset consisting of 5,294 T1-weighted magnetic resonance images of the brain from healthy individuals aged between 19 and 89 years was used to train a convolutional neural network (CNN) for biological brain age prediction. The trained model was then used to calculate the brain age gap in 195 patients with relapsing remitting MS (20-60 years). Additionally, saliency maps were generated for healthy subjects and patients with MS to identify brain regions that were deemed important for the brain age prediction task by the CNN. Overall, the application of the CNN revealed accelerated brain aging with a larger brain age gap for patients with MS with a mean of 6.98 ± 7.18 years in comparison to healthy test set subjects (0.23 ± 4.64 years). The brain age gap for MS patients was weakly to moderately correlated with age at disease onset (ρ = -0.299, p < 0.0001), EDSS score (ρ = 0.206, p = 0.004), disease duration (ρ = 0.162, p = 0.024), lesion volume (ρ = 0.630, p < 0.0001), and brain parenchymal fraction (ρ = -0.718, p < 0.0001). The saliency maps indicated significant differences in the lateral ventricle (p < 0.0001), insula (p < 0.0001), third ventricle (p < 0.0001), and fourth ventricle (p = 0.0001) in the right hemisphere. In the left hemisphere, the inferior lateral ventricle (p < 0.0001) and the third ventricle (p < 0.0001) showed significant differences. Furthermore, the Dice similarity coefficient showed the highest overlap of salient regions between the MS patients and the oldest healthy subjects, indicating that neurodegeneration is accelerated in this patient cohort. In conclusion, the results of this study show that the brain age gap is a valuable surrogate biomarker to measure disease progression in patients with multiple sclerosis.

  • Research Article
  • Cite Count Icon 2
  • 10.1007/s10278-024-01335-z
Detecting and Mitigating the Clever Hans Effect in Medical Imaging: A Scoping Review.
  • Nov 25, 2024
  • Journal of imaging informatics in medicine
  • Constanza Vásquez-Venegas + 12 more

The Clever Hans effect occurs when machine learning models rely on spurious correlations instead of clinically relevant features and poses significant challenges to the development of reliable artificial intelligence (AI) systems in medical imaging. This scoping review provides an overview of methods for identifying and addressing the Clever Hans effect in medical imaging AI algorithms. A total of 173 papers published between 2010 and 2024 were reviewed, and 37 articles were selected for detailed analysis, with classification into two categories: detection and mitigation approaches. Detection methods include model-centric, data-centric, and uncertainty and bias-based approaches, while mitigation strategies encompass data manipulation techniques, feature disentanglement and suppression, and domain knowledge-driven approaches. Despite the progress in detecting and mitigating the Clever Hans effect, the majority of current machine learning studies in medical imaging do not report or test for shortcut learning, highlighting the need for more rigorous validation and transparency in AI research. Future research should focus on creating standardized benchmarks, developing automated detection tools, and exploring the integration of detection and mitigation strategies to comprehensively address shortcut learning. Establishing community-driven best practices and leveraging interdisciplinary collaboration will be crucial for ensuring more reliable, generalizable, and equitable AI systems in healthcare.

Similar Papers
  • Research Article
  • Cite Count Icon 2
  • 10.4108/eetpht.10.5568
Exploring the Potential of Deep Learning in the Classification and Early Detection of Parkinson's Disease
  • Mar 27, 2024
  • EAI Endorsed Transactions on Pervasive Health and Technology
  • V S Bakkialakshmi + 5 more

INTRODUCTION: Parkinson's Disease (PD) is a progressive neurological disorder affecting a significant portion of the global population, leading to profound impacts on daily life and imposing substantial burdens on healthcare systems. Early identification and precise classification are crucial for effectively managing this disease. This research investigates the potential of deep learning techniques in facilitating early recognition and accurate classification of PD. OBJECTIVES: The primary objective of this study is to leverage advanced deep learning techniques for the early detection and precise classification of Parkinson's Disease. By utilizing a rich dataset comprising speech signal features extracted from 3000 PD patients, including Time Frequency Features, Mel Frequency Cepstral Coefficients (MFCCs), Wavelet Transform based Features, Vocal Fold Features, and TWQT features, this research aims to evaluate the performance of various deep learning models in PD classification. METHODS: The dataset containing diverse speech signal features from PD patients' recordings serves as the foundation for training and evaluating five different deep learning models: ResNet50, VGG16, Inception v2, AlexNet, and VGG19. Each model undergoes training and assessment to determine its capability in accurately classifying PD patients. Performance metrics such as accuracy are employed to evaluate the models' effectiveness. RESULTS: The results demonstrate promising potential, with overall accuracies ranging from 89% to 95% across the different deep learning models. Notably, AlexNet emerges as the top-performing model, achieving an accuracy of 95% and demonstrating balanced performance in accurately identifying both true and false PD cases. CONCLUSION: This research highlights the significant potential of deep learning in facilitating the early detection and classification of Parkinson's Disease. Leveraging speech signal features offers a non-invasive and cost-effective approach to PD assessment. The findings contribute to the growing body of evidence supporting the integration of artificial intelligence in healthcare, particularly in the realm of neurodegenerative disorders. Further exploration into the application of deep learning in this domain holds promise for advancing PD diagnosis and management.

  • Research Article
  • 10.52783/cana.v32.2565
Comparative Analysis of Parkinson's Disease Classification Using Deep Learning Approaches Enhanced with Optimization Techniques Versus Traditional Machine Learning Models
  • Nov 30, 2024
  • Communications on Applied Nonlinear Analysis
  • K Manisekharan

Parkinson's Disease (PD) is a neurodegenerative condition that presents considerable challenges in achieving accurate early diagnosis and classification. This research explores the classification of PD through the application of advanced deep learning techniques integrated with optimization methods, compared to conventional machine learning approaches. The analysis is conducted using a diverse dataset incorporating clinical, vocal, and movement-related features to ensure comprehensive evaluation. Deep learning frameworks, such as multi-layer perceptron (MLP), Long Short-Term Memory (LSDM), the proposed deep learning model namely CNN-BiGRU were enhanced using strategies like hyperparameter optimization, regularization, and advanced gradient-based optimizers to boost performance and minimize overfitting. Similarly, traditional machine learning models, including Linear Regression, Random Tree, REP Tree, and Random Forest, were implemented and tested on the same dataset. Evaluation metrics, including accuracy, precision, recall, F1-score, and the area under the curve (AUC), were used to measure and compare the performance of all models. The findings reveal that optimized deep learning models significantly surpass traditional machine learning methods in both classification accuracy and generalization. This study emphasizes the effectiveness of optimization-enhanced deep learning techniques in PD classification and their clear advantages over traditional models.

  • Research Article
  • 10.32985/ijeces.15.3.6
CCZO Residual GhostNet
  • Jan 1, 2024
  • International journal of electrical and computer engineering systems
  • Arogia Victor Paul M + 1 more

Parkinson's disease (PD) classification plays a crucial role in medical diagnosis and patient management. Identifying Parkinson's disease at an early stage can lead to more effective treatment and improved patient outcomes. However, existing methods for Parkinson's disease classification face several limitations. The foremost limitation is the need for accurate and reliable diagnostic tools, as misdiagnosis can lead to inappropriate treatments and unnecessary stress for patients. Thus, a hybrid deep learning model is introduced in this research. The proposed model involves the utilization of EEG signals obtained from a publicly available dataset. Key features are extracted from the EEG signals using a bandpass filter, and every feature is associated with specific brainwave frequencies and cognitive states. The feature mapping and classification are executed through the Chaotic Chebyshev Zebra optimization- based Residual GhostNet (CCZO_Residual_GhostNet). This hybrid classifier, Residual GhostNet, combines ResNet-152 with GhostNet, enhancing classification precision. Furthermore, the CCZO algorithm optimizes the loss function, introducing elements of chaos and Chebyshev mapping to improve classification accuracy. The assessment based on accuracy, sensitivity, specificity, and F-score acquired 98.76%, 98.59%, 98.95%, and 99%, respectively.

  • Research Article
  • 10.1038/s41598-025-07069-4
Enhanced EfficientNet-Extended Multimodal Parkinson's disease classification with Hybrid Particle Swarm and Grey Wolf Optimizer.
  • Sep 30, 2025
  • Scientific reports
  • K Raajasree + 1 more

Parkinson's disease (PD) is a chronic neurodegenerative disorder characterized by progressive loss of dopaminergic neurons in substantia nigra, resulting in both motor impairments and cognitive decline. Traditional PD classification methods are expert-dependent and time-intensive, while existing deep learning (DL) models often suffer from inconsistent accuracy, limited interpretability, and inability to fully capture PD's clinical heterogeneity. This study proposes a novel framework Enhanced EfficientNet-Extended Multimodal PD Classification with Hybrid Particle Swarm and Grey Wolf Optimizer (EEFN-XM-PDC-HybPS-GWO) to overcome these challenges. The model integrates T1-weighted MRI, DaTscan images, and gait scores from NTUA and PhysioNet repository respectively. Denoising is achieved via Multiscale Attention Variational Autoencoders (MSA-VAE), and critical regions are segmented using Semantic Invariant Multi-View Clustering (SIMVC). The Enhanced EfficientNet-Extended Multimodal (EEFN-XM) model extracts and fuses image and gait features, while HybPS-GWO optimizes classification weights. The system classifies subjects into early-stage PD, advanced-stage PD, and healthy controls (HCs). Ablation analysis confirms the hybrid optimizer's contribution to performance gains. The proposed model achieved 99.2% accuracy with stratified 5-fold cross-validation, outperforming DMFEN-PDC, MMT-CA-PDC, and LSTM-PDD-GS by 7.3%, 15.97%, and 10.43%, respectively, and reduced execution time by 33.33%. EEFN-XM-PDC-HybPS-GWO demonstrates superior accuracy, computational efficiency, and clinical relevance, particularly in early-stage diagnosis and PD classification.

  • Research Article
  • 10.1016/j.patrec.2024.09.026
Explainable hypergraphs for gait based Parkinson classification
  • Oct 1, 2024
  • Pattern Recognition Letters
  • Anirban Dutta Choudhury + 1 more

Explainable hypergraphs for gait based Parkinson classification

  • Research Article
  • Cite Count Icon 12
  • 10.1109/embc44109.2020.9176572
Unsupervised Pre-trained Models from Healthy ADLs Improve Parkinson's Disease Classification of Gait Patterns.
  • Jul 1, 2020
  • Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference
  • Anirudh Som + 3 more

Application and use of deep learning algorithms for different healthcare applications is gaining interest at a steady pace. However, use of such algorithms can prove to be challenging as they require large amounts of training data that capture different possible variations. This makes it difficult to use them in a clinical setting since in most health applications researchers often have to work with limited data. Less data can cause the deep learning model to over-fit. In this paper, we ask how can we use data from a different environment, different use-case, with widely differing data distributions. We exemplify this use case by using single-sensor accelerometer data from healthy subjects performing activities of daily living - ADLs (source dataset), to extract features relevant to multi-sensor accelerometer gait data (target dataset) for Parkinson's disease classification. We train the pre-trained model using the source dataset and use it as a feature extractor. We show that the features extracted for the target dataset can be used to train an effective classification model. Our pretrained source model consists of a convolutional autoencoder, and the target classification model is a simple multi-layer perceptron model. We explore two different pre-trained source models, trained using different activity groups, and analyze the influence the choice of pre-trained model has over the task of Parkinson's disease classification.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 7
  • 10.3389/frai.2024.1301997
A multi-center distributed learning approach for Parkinson's disease classification using the traveling model paradigm
  • Feb 7, 2024
  • Frontiers in Artificial Intelligence
  • Raissa Souza + 7 more

Distributed learning is a promising alternative to central learning for machine learning (ML) model training, overcoming data-sharing problems in healthcare. Previous studies exploring federated learning (FL) or the traveling model (TM) setup for medical image-based disease classification often relied on large databases with a limited number of centers or simulated artificial centers, raising doubts about real-world applicability. This study develops and evaluates a convolution neural network (CNN) for Parkinson's disease classification using data acquired by 83 diverse real centers around the world, mostly contributing small training samples. Our approach specifically makes use of the TM setup, which has proven effective in scenarios with limited data availability but has never been used for image-based disease classification. Our findings reveal that TM is effective for training CNN models, even in complex real-world scenarios with variable data distributions. After sufficient training cycles, the TM-trained CNN matches or slightly surpasses the performance of the centrally trained counterpart (AUROC of 83% vs. 80%). Our study highlights, for the first time, the effectiveness of TM in 3D medical image classification, especially in scenarios with limited training samples and heterogeneous distributed data. These insights are relevant for situations where ML models are supposed to be trained using data from small or remote medical centers, and rare diseases with sparse cases. The simplicity of this approach enables a broad application to many deep learning tasks, enhancing its clinical utility across various contexts and medical facilities.

  • Research Article
  • 10.1016/j.compbiomed.2025.111126
An optimized framework for Parkinson's disease classification using multimodal neuroimaging data with ensemble-based and data fusion networks.
  • Nov 1, 2025
  • Computers in biology and medicine
  • Abdulaziz Alorf

An optimized framework for Parkinson's disease classification using multimodal neuroimaging data with ensemble-based and data fusion networks.

  • Research Article
  • Cite Count Icon 16
  • 10.1155/2022/9276579
Metaheuristics with Deep Learning-Enabled Parkinson's Disease Diagnosis and Classification Model
  • Jan 10, 2022
  • Journal of Healthcare Engineering
  • Adel A Bahaddad + 3 more

Parkinson's disease (PD) affects the movement of people, including the differences in writing skill, speech, tremor, and stiffness in muscles. It is significant to detect the PD at the initial stages so that the person can live a peaceful life for a longer time period. The serious levels of PD are highly risky as the patients get progressive stiffness, which results in the inability of standing or walking. Earlier studies have focused on the detection of PD effectively using voice and speech exams and writing exams. In this aspect, this study presents an improved sailfish optimization algorithm with deep learning (ISFO-DL) model for PD diagnosis and classification. The presented ISFO-DL technique uses the ISFO algorithm and DL model to determine PD and thereby enhances the survival rate of the person. The presented ISFO is a metaheuristic algorithm, which is inspired by a group of hunting sailfish to determine the optimum solution to the problem. Primarily, the ISFO algorithm is applied to derive an optimal subset of features with a fitness function of maximum classification accuracy. At the same time, the rat swarm optimizer (RSO) with the bidirectional gated recurrent unit (BiGRU) is employed as a classifier to determine the existence of PD. The performance validation of the IFSO-DL model takes place using a benchmark Parkinson's dataset, and the results are inspected under several dimensions. The experimental results highlighted the enhanced classification performance of the ISFO-DL technique, and therefore, the proposed model can be employed for the earlier identification of PD.

  • Book Chapter
  • Cite Count Icon 15
  • 10.1007/978-981-16-1249-7_14
Ensemble of Deep Transfer Learning Models for Parkinson's Disease Classification
  • Jul 24, 2021
  • Kiranbabu Rajanbabu + 4 more

Diagnosis is the key step forward to cure a disease. Deep learning is becoming popular as a tool for usage in medical diagnosis. The existing literature using deep learning for the diagnosis of Parkinson’s disease (PD) by transfer learning of MRI data was limited to the AlexNet architecture. The present work aims to inculcate commonly used deep learning architectures using transfer learning for effective diagnosis of PD using MRI data. The best three performing models are selected based on the standard metric called F1-score. An ensemble model is proposed based on the maximum probability across all the selected models for PD classification. The approach mainly focuses on the effective diagnosis of PD. The performance of the proposed ensemble approach is validated using the standard metrics known as F1-score and classification accuracy. Among the commonly used deep learning architectures, it was found that VGG19 is better than the existing state-of-the-art, which is AlexNet. The proposed ensemble approach applied on top three commonly used deep learning models led to the improvement in accuracy to 0.978. The false positive (FP) and false negative (FN) were reduced significantly by the proposed ensemble approach.KeywordsDeep learningTransfer learningEnsemble methodParkinson’s diseaseCNN

  • Research Article
  • Cite Count Icon 14
  • 10.3389/fnagi.2022.808518
Investigating the Impact of Environment and Data Aggregation by Walking Bout Duration on Parkinson's Disease Classification Using Machine Learning.
  • Mar 22, 2022
  • Frontiers in Aging Neuroscience
  • Rana Zia Ur Rehman + 6 more

Parkinson’s disease (PD) is a common neurodegenerative disease. PD misdiagnosis can occur in early stages. Gait impairment in PD is typical and is linked with an increased fall risk and poorer quality of life. Applying machine learning (ML) models to real-world gait has the potential to be more sensitive to classify PD compared to laboratory data. Real-world gait yields multiple walking bouts (WBs), and selecting the optimal method to aggregate the data (e.g., different WB durations) is essential as this may influence classification performance. The objective of this study was to investigate the impact of environment (laboratory vs. real world) and data aggregation on ML performance for optimizing sensitivity of PD classification. Gait assessment was performed on 47 people with PD (age: 68 ± 9 years) and 52 controls [Healthy controls (HCs), age: 70 ± 7 years]. In the laboratory, participants walked at their normal pace for 2 min, while in the real world, participants were assessed over 7 days. In both environments, 14 gait characteristics were evaluated from one tri-axial accelerometer attached to the lower back. The ability of individual gait characteristics to differentiate PD from HC was evaluated using the Area Under the Curve (AUC). ML models (i.e., support vector machine, random forest, and ensemble models) applied to real-world gait showed better classification performance compared to laboratory data. Real-world gait characteristics aggregated over longer WBs (WB 30–60 s, WB > 60 s, WB > 120 s) resulted in superior discriminative performance (PD vs. HC) compared to laboratory gait characteristics (0.51 ≤ AUC ≤ 0.77). Real-world gait speed showed the highest AUC of 0.77. Overall, random forest trained on 14 gait characteristics aggregated over WBs > 60 s gave better performance (F1 score = 77.20 ± 5.51%) as compared to laboratory results (F1 Score = 68.75 ± 12.80%). Findings from this study suggest that the choice of environment and data aggregation are important to achieve maximum discrimination performance and have direct impact on ML performance for PD classification. This study highlights the importance of a harmonized approach to data analysis in order to drive future implementation and clinical use.Clinical Trial Registration[09/H0906/82].

  • Research Article
  • Cite Count Icon 2
  • 10.1149/10701.17671ecst
Performance Enhancement of the Parkinson's Disease Classification Using Deep Learning Model
  • Apr 24, 2022
  • ECS Transactions
  • Dinesh Kumar, D + 3 more

Parkinson's Disease is a neurological condition that affects the brain. It causes tremors in the body, hands, and spine, and causes the body to stiffen. A cure or treatment is not yet available, even though the condition is quite advanced. It is only possible to treat a disease when it is in its early stages or onset. As well as cutting the disease's costs, these measures could also save a person's life. In this paper, the study will be on Parkinson's Disease and the capsule network will be used for classification of the Parkinson's Disease. Dataset used for the analysis is downloaded from the Physionet, which consists of images. Research presented here focuses on applying deep learning models to fully understand Parkinson's Disease and identify its earliest signs. In order to assess the model, we look at its precision, recall, and other related metrics. The result section has shown that the capsule network has performed better than the other existing algorithms.

  • Research Article
  • Cite Count Icon 24
  • 10.1016/j.compbiomed.2023.107031
Diagnosis and classification of Parkinson's disease using ensemble learning and 1D-PDCovNN
  • May 17, 2023
  • Computers in Biology and Medicine
  • Majid Nour + 2 more

Diagnosis and classification of Parkinson's disease using ensemble learning and 1D-PDCovNN

  • Research Article
  • Cite Count Icon 13
  • 10.1093/jamia/ocad171
Image-encoded biological and non-biological variables may be used as shortcuts in deep learning models trained on multisite neuroimaging data
  • Sep 5, 2023
  • Journal of the American Medical Informatics Association : JAMIA
  • Raissa Souza + 6 more

ObjectiveThis work investigates if deep learning (DL) models can classify originating site locations directly from magnetic resonance imaging (MRI) scans with and without correction for intensity differences.Material and MethodsA large database of 1880 T1-weighted MRI scans collected across 41 sites originally for Parkinson’s disease (PD) classification was used to classify sites in this study. Forty-six percent of the datasets are from PD patients, while 54% are from healthy participants. After preprocessing the T1-weighted scans, 2 additional data types were generated: intensity-harmonized T1-weighted scans and log-Jacobian deformation maps resulting from nonlinear atlas registration. Corresponding DL models were trained to classify sites for each data type. Additionally, logistic regression models were used to investigate the contribution of biological (age, sex, disease status) and non-biological (scanner type) variables to the models’ decision.ResultsA comparison of the 3 different types of data revealed that DL models trained using T1-weighted and intensity-harmonized T1-weighted scans can classify sites with an accuracy of 85%, while the model using log-Jacobian deformation maps achieved a site classification accuracy of 54%. Disease status and scanner type were found to be significant confounders.DiscussionOur results demonstrate that MRI scans encode relevant site-specific information that models could use as shortcuts that cannot be removed using simple intensity harmonization methods.ConclusionThe ability of DL models to exploit site-specific biases as shortcuts raises concerns about their reliability, generalization, and deployability in clinical settings.

  • Research Article
  • 10.1080/0954898x.2025.2514187
Hybrid optimization enabled Eff-FDMNet for Parkinson’s disease detection and classification in federated learning
  • Aug 2, 2025
  • Network: Computation in Neural Systems
  • Sangeetha Subramaniam + 1 more

Parkinson’s Disease (PD) is a progressive neurodegenerative disorder and the early diagnosis is crucial for managing symptoms and slowing disease progression. This paper proposes a framework named Federated Learning Enabled Waterwheel Shuffled Shepherd Optimization-based Efficient-Fuzzy Deep Maxout Network (FedL_WSSO based Eff-FDMNet) for PD detection and classification. In local training model, the input image from the database “Image and Data Archive (IDA)” is given for preprocessing that is performed using Gaussian filter. Consequently, image augmentation takes place and feature extraction is conducted. These processes are executed for every input image. Therefore, the collected outputs of images are used for PD detection using Shepard Convolutional Neural Network Fuzzy Zeiler and Fergus Net (ShCNN-Fuzzy-ZFNet). Then, PD classification is accomplished using Eff-FDMNet, which is trained using WSSO. At last, based on CAViaR, local updation and aggregation are changed in server. The developed method obtained highest accuracy as 0.927, mean average precision as 0.905, lowest false positive rate (FPR) as 0.082, loss as 0.073, Mean Squared Error (MSE) as 0.213, and Root Mean Squared Error (RMSE) as 0.461. The high accuracy and low error rates indicate that the potent framework can enhance patient outcomes by enabling more reliable and personalized diagnosis.

More from: IEEE journal of biomedical and health informatics
  • New
  • Research Article
  • 10.1109/jbhi.2025.3630249
Dual-Branch Attention-based Frequency Domain Network for Cross-subject SSVEP-BCIs.
  • Nov 7, 2025
  • IEEE journal of biomedical and health informatics
  • Yi Yang + 8 more

  • New
  • Research Article
  • 10.1109/jbhi.2025.3629974
Detecting Driver Sleepiness From Physiological Indicators Using a CNN-LSTM Self-Attention Model.
  • Nov 7, 2025
  • IEEE journal of biomedical and health informatics
  • Yingying Jiao + 3 more

  • Research Article
  • 10.1109/jbhi.2025.3616477
IEEE Journal of Biomedical and Health Informatics Publication Information
  • Nov 1, 2025
  • IEEE Journal of Biomedical and Health Informatics

  • Research Article
  • 10.1109/jbhi.2025.3616479
Table of Contents
  • Nov 1, 2025
  • IEEE Journal of Biomedical and Health Informatics

  • Research Article
  • 10.1109/jbhi.2025.3573272
An Improved Microbial Object Detection Method for Low-Contrast and Occluded Scenarios Based on SMA-YOLOv8s.
  • Nov 1, 2025
  • IEEE journal of biomedical and health informatics
  • Kun Sun + 6 more

  • Research Article
  • 10.1109/jbhi.2025.3616475
Front Cover
  • Nov 1, 2025
  • IEEE Journal of Biomedical and Health Informatics

  • Research Article
  • 10.1109/jbhi.2025.3616481
IEEE Journal of Biomedical and Health Informatics Information for Authors
  • Nov 1, 2025
  • IEEE Journal of Biomedical and Health Informatics

  • Research Article
  • 10.1109/jbhi.2025.3626200
Bilateral Information-Guided Diagnosis of Breast Masses in Mammography Using Vision Transformer.
  • Oct 31, 2025
  • IEEE journal of biomedical and health informatics
  • Tianyu Zeng + 8 more

  • Research Article
  • 10.1109/jbhi.2025.3626332
A Novel Binocular-Encoded SSVEP Framework for Efficient VR-Based Brain-Computer Interface.
  • Oct 31, 2025
  • IEEE journal of biomedical and health informatics
  • Haifeng Liu + 6 more

  • Research Article
  • 10.1109/jbhi.2025.3600227
Deep learning-based stride segmentation with wearable sensors: effects of data quantity, sensor location, and task.
  • Oct 31, 2025
  • IEEE journal of biomedical and health informatics
  • Anthony J Anderson + 8 more

Save Icon
Up Arrow
Open/Close
  • Ask R Discovery Star icon
  • Chat PDF Star icon
Setting-up Chat
Loading Interface