Copy number variation (CNV) represents a major source of genomic variation. We investigated the diversity of CNV distribution using SNP array data collected from a comprehensive collection of geographically dispersed sheep breeds. We identified 24,558 putative CNVs, which can be merged into 619 CNV regions, spanning 197Mb of total length and corresponding to ~6.9% of the sheep genome. Our results reveal a population differentiation in CNV between different geographical areas, including Africa, America, Asia, Southwestern Asia, Central Europe, Northern Europe and Southwestern Europe. We observed clear distinctions in CNV prevalence between diverse groups, possibly reflecting the population history of different sheep breeds. We sought to determine the gene content of CNV, and found several important CNV-overlapping genes (BTG3, PTGS1 and PSPH) which were involved in fetal muscle development, prostaglandin (PG) synthesis, and bone color. Our study generates a comprehensive CNV map, which may contribute to genome annotation in sheep.
Read full abstract