Abstract

Individuals with Parkinson’s disease present with a complex clinical phenotype, encompassing sleep, motor, cognitive, and affective disturbances. However, characterizations of PD are typically made for the “average” patient, ignoring patient heterogeneity and obscuring important individual differences. Modern large-scale data sharing efforts provide a unique opportunity to precisely investigate individual patient characteristics, but there exists no analytic framework for comprehensively integrating data modalities. Here we apply an unsupervised learning method—similarity network fusion—to objectively integrate MRI morphometry, dopamine active transporter binding, protein assays, and clinical measurements from n = 186 individuals with de novo Parkinson’s disease from the Parkinson’s Progression Markers Initiative. We show that multimodal fusion captures inter-dependencies among data modalities that would otherwise be overlooked by field standard techniques like data concatenation. We then examine how patient subgroups derived from the fused data map onto clinical phenotypes, and how neuroimaging data is critical to this delineation. Finally, we identify a compact set of phenotypic axes that span the patient population, demonstrating that this continuous, low-dimensional projection of individual patients presents a more parsimonious representation of heterogeneity in the sample compared to discrete biotypes. Altogether, these findings showcase the potential of similarity network fusion for combining multimodal data in heterogeneous patient populations.

Highlights

  • Individuals with Parkinson’s disease (PD) present with a range of symptoms, including sleep, motor, cognitive, and affective disturbances[1]

  • Cortical thickness and subcortical volume are both derived from the same data modality, they are estimated using different algorithms so we retain them as separate sources

  • SNF appears much more stable to changes in data dimensionality, with NMI scores and modularity estimates more evenly distributed across data modalities (NMI = 0.15 ± 0.18 [0.01–0.50]; modularity = 0.05 ± 0.06 [0.00–0.17]). While this does not necessarily imply that the generated clustering assignments are meaningful, it does suggest that SNF provides a more balanced representation of the input data than simple concatenation, opening the door for a more holistic assessment of their data included in SNF, we found significant differentiation of PD-ICA atrophy scores between biotypes (F(2, 183) = 5.70, p = 0.004), largely driven by lower atrophy in the “intermediate” compared to the “mild” group

Read more

Summary

Introduction

Individuals with Parkinson’s disease (PD) present with a range of symptoms, including sleep, motor, cognitive, and affective disturbances[1]. Most attempts to resolve heterogeneity in PD rely on clustering or subtyping of patients based solely on clinical-behavioral assessments. While these efforts have shown that it is possible to stratify patients into clinically meaningful categories with considerable predictive utility[4,5]; cf., clinical measures do not directly measure the underlying pathophysiology of PD7. There exist many families of techniques for investigating multi-view data, including multiple kernel learning, matrix factorization, and deep learning, that have been increasingly used in recent years to tackle issues of data integration[25]. The most commonly employed technique—to concatenate data modalities—

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call