Psychiatric disorders are among the most common and debilitating illnesses across the lifespan and begin usually during childhood and adolescence, which emphasizes the importance of studying the developing brain. Most of the previous pediatric neuroimaging studies employed traditional univariate statistics on relatively small samples. Multivariate machine learning approaches have a great potential to overcome the limitations of these approaches. On the other hand, the vast majority of existing multivariate machine learning studies have focused on differentiating between children with an isolated psychiatric disorder and typically developing children. However, this line of research does not reflect the real-life situation as the majority of children with a clinical diagnosis have multiple psychiatric disorders (multimorbidity), and consequently, a clinician has the task to choose between different diagnoses and/or the combination of multiple diagnoses. Thus, the goal of the present benchmark is to predict psychiatric multimorbidity in children and adolescents. For this purpose, we implemented two kinds of machine learning benchmark challenges: The first challenge targets the prediction of the seven most prevalent DSM-V psychiatric diagnoses for the available data set, of which each individual can exhibit multiple ones concurrently (i.e. multi-task multi-label classification). Based on behavioral and cognitive measures, a second challenge focuses on predicting psychiatric symptom severity on a dimensional level (i.e. multiple regression task). For the present benchmark challenges, we will leverage existing and future data from the biobank of the Healthy Brain Network (HBN) initiative, which offers a unique large-sample dataset (N = 2042) that provides a wide array of different psychiatric developmental disorders and true hidden data sets. Due to limited real-world practicability and economic viability of MRI measurements, the present challenge will permit only resting state EEG data and demographic information to derive predictive models. We believe that a community driven effort to derive predictive markers from these data using advanced machine learning algorithms can help to improve the diagnosis of psychiatric developmental disorders.
Read full abstract