Abstract

Abstract Differentially Expressed Genes (DEGs) are treated as candidate biomarkers, and a small set of DEGs might be identified as biomarkers using either biological knowledge or data-driven approaches like machine learning and statistical analysis. In this study, we used a combination of the machine learning algorithm and statistical tests to identify the top 300 genes that are differentially expressed in ASD compared to Typically Developed (TD). Initially, we extracted microarray gene expression data of 15 ASD and 15 TD from NCBI GEO database and used a standard pipeline to preprocess the data. Further, Random Forest (RF) was used to discriminate genes between ASD and TD. We, then analyzed the upregulated and downregulated genes using the logFC value to gain insights into their potential roles in the development of ASD. We further used drug-gene interaction analysis from ConnectivityMap to identify drugs that can inhibit the expression of these genes. Our results show that the proposed RF model yields average 5-fold cross-validation accuracy, sensitivity, and specificity of 96.67%. Further, we obtained precision and F-measure scores of 97.5% and 96.57%, respectively. Our analysis identified several novel genes that are dysregulated in ASD, including genes (such as proliferation-inducing protein 38 and germinal centre expressed transcript 32) involved in synaptic transmission, neural development, and immune function. We also identified several drugs (such as ATPase_Inhibitor, kinase inhibitors, and histone deacetylase inhibitors) that can potentially be used to treat ASD. Our findings provide new insights into the molecular mechanisms of ASD and suggest potential targets for drug development. These findings may lead to new therapeutic approaches for the treatment of ASD.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call