Machine Learning Prediction of Autism Spectrum Disorder From a Minimal Set of Medical and Background Information

Shyam Sundar Rajagopalan,Yali Zhang,Ashraf Yahia,Kristiina Tammimies

doi:10.1001/jamanetworkopen.2024.29229

Abstract

Early identification of the likelihood of autism spectrum disorder (ASD) using minimal information is crucial for early diagnosis and intervention, which can affect developmental outcomes. To develop and validate a machine learning (ML) model for predicting ASD using a minimal set of features from background and medical information and to evaluate the predictors and the utility of the ML model. For this diagnostic study, a retrospective analysis of the Simons Foundation Powering Autism Research for Knowledge (SPARK) database, version 8 (released June 6, 2022), was conducted, including data from 30 660 participants after adjustments for missing values and class imbalances (15 330 with ASD and 15 330 without ASD). The SPARK database contains participants recruited from 31 university-affiliated research clinicals and online in 26 states in the US. All individuals with a professional ASD diagnosis and their families were eligible to participate. The model performance was validated on independent datasets from SPARK, version 10 (released July 21, 2023), and the Simons Simplex Collection (SSC), consisting of 14 790 participants, followed by phenotypic associations. Twenty-eight basic medical screening and background history items present before 24 months of age. Generalizable ML prediction models were developed for detecting ASD using 4 algorithms (logistic regression, decision tree, random forest, and eXtreme Gradient Boosting [XGBoost]). Performance metrics included accuracy, area under the receiver operating characteristics curve (AUROC), sensitivity, specificity, positive predictive value (PPV), and F1 score, offering a comprehensive assessment of the predictive accuracy of the model. Explainable AI methods were applied to determine the effect of individual features in predicting ASD as secondary outcomes, enhancing the interpretability of the best-performing model. The secondary outcome analyses were further complemented by examining differences in various phenotypic measures using nonparametric statistical methods, providing insights into the ability of the model to differentiate between different presentations of ASD. The study included 19 477 (63.5%) male and 11 183 (36.5%) female participants (mean [SD] age, 106 [62] months). The mean (SD) age was 113 (68) months for the ASD group and 100 (55) months for the non-ASD group. The XGBoost (termed AutMedAI) model demonstrated strong performance with an AUROC score of 0.895, sensitivity of 0.805, specificity of 0.829, and PPV of 0.897. Developmental milestones and eating behavior were the most important predictors. Validation on independent cohorts showed an AUROC of 0.790, indicating good generalizability. In this diagnostic study of ML prediction of ASD, robust model performance was observed to identify autistic individuals with more symptoms and lower cognitive levels. The robustness and ML model generalizability results are promising for further validation and use in clinical and population settings.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Machine Learning Prediction of Autism Spectrum Disorder From a Minimal Set of Medical and Background Information

Abstract

Talk to us

Similar Papers

More From: JAMA Network Open

Lead the way for us

Journal: JAMA Network Open	Publication Date: Aug 19, 2024
License type: cc-by

Similar Papers

Machine learning determination of applied behavioral analysis treatment plan type
Jenish Maharjan ... Ella Browning
Brain Informatics | VOL. 10
Jenish Maharjan, et. al.Jenish Maharjan ... Ella Browning
02 Mar 2023
Brain Informatics | VOL. 10

Development and validation of a machine-learning model for prediction of shoulder dystocia.
A Tsur ... Y Brezinov
Ultrasound in Obstetrics & Gynecology | VOL. 56
A Tsur, et. al.A Tsur ... Y Brezinov
01 Oct 2020
Ultrasound in Obstetrics & Gynecology | VOL. 56

Machine learning predictive model for aspiration screening in hospitalized patients with acute stroke
Dougho Park ... Mun-Chul Kim
Scientific reports | VOL. 13
Dougho Park, et. al.Dougho Park ... Mun-Chul Kim
15 May 2023
Scientific reports | VOL. 13

Machine learning-based prediction of in-ICU mortality in pneumonia patients
Eun-Tae Jeon ... Kwang Nam Jin
Scientific Reports | VOL. 13
Eun-Tae Jeon, et. al.Eun-Tae Jeon ... Kwang Nam Jin
17 Jul 2023
Scientific Reports | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Machine Learning Prediction of Autism Spectrum Disorder From a Minimal Set of Medical and Background Information

Abstract

Talk to us

Similar Papers

More From: JAMA Network Open