Abstract
BackgroundParentage verification by molecular markers is mainly based on short tandem repeat markers. Single nucleotide polymorphisms (SNPs) as bi-allelic markers have become the markers of choice for genotyping projects. Thus, the subsequent step is to use SNP genotypes for parentage verification as well. Recent developments of algorithms such as evaluating opposing homozygous SNP genotypes have drawbacks, for example the inability of rejecting all animals of a sample of potential parents. This paper describes an algorithm for parentage verification by constrained regression which overcomes the latter limitation and proves to be very fast and accurate even when the number of SNPs is as low as 50. The algorithm was tested on a sample of 14,816 animals with 50, 100 and 500 SNP genotypes randomly selected from 40k genotypes. The samples of putative parents of these animals contained either five random animals, or four random animals and the true sire. Parentage assignment was performed by ranking of regression coefficients, or by setting a minimum threshold for regression coefficients. The assignment quality was evaluated by the power of assignment (P_{text {a}}) and the power of exclusion (P_{text {e}}).ResultsIf the sample of putative parents contained the true sire and parentage was assigned by coefficient ranking, P_{text {a}} and P_{text {e}} were both higher than 0.99 for the 500 and 100 SNP genotypes, and higher than 0.98 for the 50 SNP genotypes. When parentage was assigned by a coefficient threshold, P_{text {e}} was higher than 0.99 regardless of the number of SNPs, but P_{text {a}} decreased from 0.99 (500 SNPs) to 0.97 (100 SNPs) and 0.92 (50 SNPs). If the sample of putative parents did not contain the true sire and parentage was rejected using a coefficient threshold, the algorithm achieved a P_{text {e}} of 1 (500 SNPs), 0.99 (100 SNPs) and 0.97 (50 SNPs).ConclusionThe algorithm described here is easy to implement, fast and accurate, and is able to assign parentage using genomic marker data with a size as low as 50 SNPs.
Highlights
Parentage verification by molecular markers is mainly based on short tandem repeat markers
When 500 randomly selected Single nucleotide polymorphisms (SNPs) were used as genotypes and the sample of putative parents contained the true sire, power of assignment (Pa) of all algorithms was higher than 0.99 with marginal differences between the methods
Results show that parentage can be successfully assigned with as few as 500 SNPs using CGRR, CGRT, opposing homozygous marker loci (OHL) counting or a likelihood-based method, and for which all four approaches perform well
Summary
Parentage verification by molecular markers is mainly based on short tandem repeat markers. For the last two decades, this verification was based on short tandem repeat markers (STR), which are commonly called micro-satellites, are highly polymorphic and allow discrimination between individuals even when the total number of markers used is small. Due to their highly polymorphic character, parentage assignment on the basis of STR can in the last decade bi-allelic single nucleotide polymorphisms (SNPs) have quickly become the marker of choice for genotyping projects. SNP genotypes have become the backbone of genomic selection which is replacing pedigree selection in many livestock industries [2]
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.