Abstract

Genomic structural variations represent an important source of genetic variation in mammal genomes, thus, they are commonly related to phenotypic expressions. In this work, ∼770,000 single nucleotide polymorphism genotypes from 506 animals from 19 cattle breeds were analyzed. A simple LD-based structural variation was defined, and a genome-wide analysis was performed. After applying some quality control filters, for each breed and each chromosome we calculated the linkage disequilibrium (r 2) of short range (≤100 Kb). We sorted SNP pairs by distance and obtained a set of LD means (called the expected means) using bins of 5 Kb. We identified 15,246 segments of at least 1 Kb, among the 19 breeds, consisting of sets of at least 3 adjacent SNPs so that, for each SNP, r 2 within its neighbors in a 100 Kb range, to the right side of that SNP, were all bigger than, or all smaller than, the corresponding expected mean, and their P-value were significant after a Benjamini-Hochberg multiple testing correction. In addition, to account just for homogeneously distributed regions we considered only SNPs having at least 15 SNP neighbors within 100 Kb. We defined such segments as structural variations. By grouping all variations across all animals in the sample we defined 9,146 regions, involving a total of 53,137 SNPs; representing the 6.40% (160.98 Mb) from the bovine genome. The identified structural variations covered 3,109 genes. Clustering analysis showed the relatedness of breeds given the geographic region in which they are evolving. In summary, we present an analysis of structural variations based on the deviation of the expected short range LD between SNPs in the bovine genome. With an intuitive and simple definition based only on SNPs data it was possible to discern closeness of breeds due to grouping by geographic region in which they are evolving.

Highlights

  • It is well known that among human beings, 99.9% of DNA sequence is identical [1], but, differences contribute to genetic variations among people

  • Polymorphic proportions and Linkage Disequilibrium The filtered data set included an average of 523,651 genotypes per breed, among the 29 autosomal chromosomes in 506 animals from 19 breeds (See Methods section for a description of quality control filters, Table S2 in File S2 for details of total markers in each breed and chromosomal separately, and Table S1 in File S2 for details on breeds descriptions and sample numbers)

  • To characterize SNPs, we determined the distribution of minor allele frequencies (MAF) in each of the 19 breeds

Read more

Summary

Introduction

It is well known that among human beings, 99.9% of DNA sequence is identical [1], but, differences contribute to genetic variations among people. Studies on the human genome were limited to those that could be identified through a microscope. Scientists were able to characterize smaller differences, as well as abundant alterations in short DNA sequences, typically smaller than 1 Kb. In recent years, comparative sequence analyses have revealed DNA variation that involves segments that are smaller than those recognized microscopically, but larger than those readily detected by conventional sequence analysis. Comparative sequence analyses have revealed DNA variation that involves segments that are smaller than those recognized microscopically, but larger than those readily detected by conventional sequence analysis These variations range from ,1 Kb to 3 Mb and consist on insertions, deletions, inversions and duplications and may even contain whole genes [2]. A structural variation is designated as structural abnormality when it is established that it can, by itself, or in combination with other environmental factors, be the cause of a genetic disease or a phenotypic expression

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.