Abstract

BackgroundMammalian centromeres are satellite-rich chromatin domains that execute conserved roles in kinetochore assembly and chromosome segregation. Centromere satellites evolve rapidly between species, but little is known about population-level diversity across these loci.ResultsWe developed a k-mer based method to quantify centromere copy number and sequence variation from whole genome sequencing data. We applied this method to diverse inbred and wild house mouse (Mus musculus) genomes to profile diversity across the core centromere (minor) satellite and the pericentromeric (major) satellite repeat. We show that minor satellite copy number varies more than 10-fold among inbred mouse strains, whereas major satellite copy numbers span a 3-fold range. In contrast to widely held assumptions about the homogeneity of mouse centromere repeats, we uncover marked satellite sequence heterogeneity within single genomes, with diversity levels across the minor satellite exceeding those at the major satellite. Analyses in wild-caught mice implicate subspecies and population origin as significant determinants of variation in satellite copy number and satellite heterogeneity. Intriguingly, we also find that wild-caught mice harbor dramatically reduced minor satellite copy number and elevated satellite sequence heterogeneity compared to inbred strains, suggesting that inbreeding may reshape centromere architecture in pronounced ways.ConclusionTaken together, our results highlight the power of k-mer based approaches for probing variation across repetitive regions, provide an initial portrait of centromere variation across Mus musculus, and lay the groundwork for future functional studies on the consequences of natural genetic variation at these essential chromatin domains.

Highlights

  • Mammalian centromeres are satellite-rich chromatin domains that execute conserved roles in kinetochore assembly and chromosome segregation

  • The Centromere protein A (CENP-A) amino acid sequence is perfectly conserved among M. musculus subspecies, but sequence diversity at the centromere satellite could influence the efficiency of CENP-A binding, with potential downstream consequences for kinetochore assembly and chromosome segregation [22, 56]

  • Our study presents the largest survey of population-level centromere variation in mammals to date, encompassing both inbred and wild mice from three cardinal M. musculus subspecies

Read more

Summary

Introduction

Mammalian centromeres are satellite-rich chromatin domains that execute conserved roles in kinetochore assembly and chromosome segregation. Centromeres are chromatin domains that are essential for chromosome segregation and the maintenance of genome stability [1,2,3,4]. As a consequence of this satellite-rich architecture, centromeres are predisposed to high rates of structural mutation via replication slippage, unequal exchange, and transposition [9]. These processes actively contribute to satellite repeat size and sequence variability. The remarkable size and sequence variability of centromeres, combined with their critical and highly conserved cellular roles in chromosome segregation and genome stability, impose an enduring biological paradox

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call