Short tandem repeats (STRs) are repetitive DNA sequences and highly mutable in various human disorders. While the involvement of STRs in various genetic disorders has been extensively studied, their role in autism spectrum disorder (ASD) remains largely unexplored. In this study, we aimed to investigate genetic association of STR expansions with ASD using whole genome sequencing (WGS) and identify risk loci associated with ASD phenotypes. We analyzed WGS data of 634 ASD families and performed genome-wide evaluation for 12,929 STR loci. We found rare STR expansions that exceeded normal repeat lengths in autism cases compared to unaffected controls. By integrating single cell RNA and ATAC sequencing datasets of human postmortem brains, we prioritized STR loci in genes specifically expressed in cortical development stages. A deep learning method was used to predict functionality of ASD-associated STR loci. In ASD cases, rare STR expansions predominantly occurred in early cortical layer-specific genes involved in neurodevelopment, highlighting the cellular specificity of STR-associated genes in ASD risk. Leveraging deep learning prediction models, we demonstrated that these STR expansions disrupted the regulatory activity of enhancers and promoters, suggesting a potential mechanism through which they contribute to ASD pathogenesis. We found that individuals with ASD-associated STR expansions exhibited more severe ASD phenotypes and diminished adaptability compared to non-carriers. Short tandem repeat expansions in cortical layer-specific genes are associated with ASD and could potentially be a risk genetic factor for ASD. Our study is the first to show evidence of STR expansion associated with ASD in an under-investigated population.
Read full abstract