Short tandem repeats (STRs) are abundant and have high mutation rates across cattle genomes; however, comprehensive exploration of cattle STRs is needed. Here, we constructed a comprehensive map of 467 553 polymorphic STRs (pSTRs) constructed from 423 cattle genomes representing 59 breeds worldwide. We observed that pSTRs in coding sequences and 5'UTRs (Untranslated Regions) were under strong selective constraints and exhibited a relatively low level of diversity. Furthermore, we found that these pSTRs underwent more contraction than expansion. Population analysis showed a strong positive correlation (R = 1) between pSTR diversity and single nucleotide polymorphic heterozygosity. We also investigated STR differences between taurine and indicine cattle and detected 2301 highly divergent STRs, which might relate to immune, endocrine and neurodevelopmental pathways. In summary, our large-scale study characterizes the spectrum of STRs in cattle, expands the scale of known cattle STR variation and provides novel insights into differences among various cattle subspecies.
Read full abstract