Abstract

BackgroundGenomic evaluation is used to predict direct genomic values (DGV) for selection candidates in breeding programs, but also to estimate allele substitution effects (ASE) of single nucleotide polymorphisms (SNPs). Scaling of allele counts influences the estimated ASE, because scaling of allele counts results in less shrinkage towards the mean for low minor allele frequency (MAF) variants. Scaling may become relevant for estimating ASE as more low MAF variants will be used in genomic evaluations. We show the impact of scaling on estimates of ASE using real data and a theoretical framework, and in terms of power, model fit and predictive performance.ResultsIn a dairy cattle dataset with 630 K SNP genotypes, the correlation between DGV for stature from a random regression model using centered allele counts (RRc) and centered and scaled allele counts (RRcs) was 0.9988, whereas the overall correlation between ASE using RRc and RRcs was 0.27. The main difference in ASE between both methods was found for SNPs with a MAF lower than 0.01. Both the ratio (ASE from RRcs/ASE from RRc) and the regression coefficient (regression of ASE from RRcs on ASE from RRc) were much higher than 1 for low MAF SNPs. Derived equations showed that scenarios with a high heritability, a large number of individuals and a small number of variants have lower ratios between ASE from RRc and RRcs. We also investigated the optimal scaling parameter [from − 1 (RRcs) to 0 (RRc) in steps of 0.1] in the bovine stature dataset. We found that the log-likelihood was maximized with a scaling parameter of − 0.8, while the mean squared error of prediction was minimized with a scaling parameter of − 1, i.e., RRcs.ConclusionsLarge differences in estimated ASE were observed for low MAF SNPs when allele counts were scaled or not scaled because there is less shrinkage towards the mean for scaled allele counts. We derived a theoretical framework that shows that the difference in ASE due to shrinkage is heavily influenced by the power of the data. Increasing the power results in smaller differences in ASE whether allele counts are scaled or not.

Highlights

  • Genomic evaluation is used to predict direct genomic values (DGV) for selection candidates in breeding programs, and to estimate allele substitution effects (ASE) of single nucleotide polymorphisms (SNPs)

  • Genomic evaluation To show the impact of scaling on ASE, the effects were estimated with a random regression model (SNP-best linear unbiased prediction (BLUP)) using two different genotype coding methods: centered allele counts (RRc) and centered and scaled allele counts (RRcs)

  • Scaling in this study we presented SNP-BLUP models, equivalent models that do not explicitly estimate ASE, i.e., genomic restricted maximum likelihood estimation (GREML) or GBLUP, will result in different estimates of the ASE when the allele counts used in the construction of a genomic relationship matrix (GRM) are scaled or not

Read more

Summary

Introduction

Genomic evaluation is used to predict direct genomic values (DGV) for selection candidates in breeding programs, and to estimate allele substitution effects (ASE) of single nucleotide polymorphisms (SNPs). In addition to the DGV, allele substitution effects (ASE) are or can be computed using genomic evaluation models. An ASE represents the effect that the presence of a copy of that allele has on the phenotype This applies for the estimation of such effects in genomic evaluation. Bouwman et al Genet Sel Evol (2017) 49:79 variable selection, SNP-best linear unbiased prediction (BLUP) or ridge regression [6, 7] Other methods such as genomic (G)BLUP, genomic restricted maximum likelihood estimation (GREML), and one-step methods, use a genomic relationship matrix (GRM) that is constructed from the SNP genotypes [8,9,10]. With a GBLUP or GREML approach, it is straightforward to back-solve the ASE from DGV based on the genotypes of the animals [11]

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call