Abstract
At least 40 human diseases are associated with repeat expansions; yet, the mutational origin and instability mechanisms remain unknown for most of them. Previously, genetic epidemiology and predisposing backgrounds for the instability of some expanding loci have been studied in different populations through the analysis of diversity flanking the respective pathogenic repeats. Here, we aimed at developing a pipeline to assess disease-associated haplotypes at oligonucleotide repeat loci, combining analysis of single nucleotide polymorphisms (SNPs) and short tandem repeats (STRs). Machado-Joseph disease (MJD/SCA3), the most frequent dominant ataxia worldwide, was used as an example of a detailed procedure. Thus, to identify genetic backgrounds that segregate with expanded/mutated alleles in MJD, we selected a set of 26 SNPs and 7 STRs flanking the causative CAG repeat. Key criteria and steps for this selection are described, and included (1) haplotype blocks minimizing the occurrence of recombination (for SNPs); and (2) match scores to increase potential for polymorphic information content of repetitive sequences found in Tandem Repeats Finder (for STRs). To directly assess SNP haplotypes in phase with MJD expansions, we optimized a strategy with preferential amplification of normal over expanded alleles, in addition to SNP allele-specific amplifications; this allowed the identification of disease-associated SNP haplotypes, even when only the proband is available in a given family. To infer STR haplotypes, we optimized a multiplex PCR, including 7 STRs plus the MJD_CAG repeat, followed by analysis of segregation or the use of the PHASE software. This protocol is a ready-to-use tool to assess MJD haplotypes in different populations. The pipeline designed can be used to assess disease-associated haplotypes in other repeat-expansion diseases. This should be of great utility to study (1) genetic epidemiology (population-of-origin, age and spreading routes of mutations) and (2) mechanisms responsible for de novo expansions, in these neurological diseases; (3) to detect predisposing haplotypes and (4) phenotype modifiers; (5) to help solving cases of apparent homoallelism (two same-size normal alleles) in diagnosis; and (6) to identify the best targets for the development of allele-specific therapies in ethnically diverse patient populations.
Highlights
Repetitive DNA sequences with a capacity to expand are found in the human genome in non-coding and exonic regions
Genotypes of Single nucleotide polymorphisms (SNPs) and Short tandem repeats (STRs) were obtained in 92 families, a successful rate for SNP and STR amplification of 92% - taking into account the long-term storage of most DNA samples analyzed, we considered 8% a low failure rate
We have estimated success rate for haplotype inference, given the importance of assessing allelic phase of polymorphisms segregating with the pathogenic expansion: following our strategy of SNP allele-specific amplification, MJD lineages were accurately identified in all 92 families; as for flanking STRs, we were able to reconstruct haplotypes in 85% of the families genotyped (78/92); of note is the fact that 48% (44/92) of the families were composed solely by the proband and that in other 15% (14/92) only a single relative was available
Summary
Repetitive DNA sequences with a capacity to expand (sometimes up to hundreds or thousands of repeats) are found in the human genome in non-coding and exonic regions. They are currently known to be associated with approximately 40 human diseases (reviewed in Paulson, 2018). Among them are the spinocerebellar ataxias (SCA1, DRPLA, SCA2, MJD/SCA3, SCA6, SCA7, SCA8, SCA10, SCA12, SCA17, SCA31, and SCA36), Huntington’s disease, myotonic dystrophy, spinal-bulbar muscular atrophy and fragile X syndrome While each of these diseases is rare worldwide, together they are one of the commonest causes of hereditary neurological pathology (Sequeiros et al, 2011; Paulson, 2018). Given the importance of haplotype analyses (including SNPs and STRs) to perform a comprehensive study of oligonucleotide repeat-related diseases, we designed a strategy to identify diseaseassociated haplotypes and show here the example of this approach to analyse the ATXN3 locus
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have