Semi-parametric estimates of population accuracy and bias of predictions of breeding values and future phenotypes using the LR method

Andres Legarra,Antonio Reverter

doi:10.1186/s12711-018-0426-6

Abstract

BackgroundCross-validation tools are used increasingly to validate and compare genetic evaluation methods but analytical properties of cross-validation methods are rarely described. There is also a lack of cross-validation tools for complex problems such as prediction of indirect effects (e.g. maternal effects) or for breeding schemes with small progeny group sizes.ResultsWe derive the expected value of several quadratic forms by comparing genetic evaluations including “partial” and “whole” data. We propose statistics that compare genetic evaluations including “partial” and “whole” data based on differences in means, covariance, and correlation, and term the use of these statistics “method LR” (from linear regression). Contrary to common belief, the regression of true on estimated breeding values is (on expectation) lower than 1 for small or related validation sets, due to family structures. For validation sets that are sufficiently large, we show that these statistics yield estimators of bias, slope or dispersion, and population accuracy for estimated breeding values. Similar results hold for prediction of future phenotypes although we show that estimates of bias, slope or dispersion using prediction of future phenotypes are sensitive to incorrect heritabilities or precorrection for fixed effects. We present an example for a set of 2111 Brahman beef cattle for which, in repeated partitioning of the data into training and validation sets, there is very good agreement of statistics of method LR with prediction of future phenotypes.ConclusionsAnalytical properties of cross-validation measures are presented. We present a new method named LR for cross-validation that is automatic, easy to use, and which yields the quantities of interest. The method compares predictions based on partial and whole data, which results in estimates of accuracy and biases. Prediction of observed records may yield biased results due to precorrection or use of incorrect heritabilities.

Highlights

Cross-validation tools are used increasingly to validate and compare genetic evaluation methods but analytical properties of cross-validation methods are rarely described
Initial applications of genomic predictions of breeding values (GEBV) in dairy cattle led to biases, with young “genomic” selected bulls with high GEBV being
The introduction of new methods for genetic or genomic evaluation raises the question of model choice and model quality

Summary

Introduction

Cross-validation tools are used increasingly to validate and compare genetic evaluation methods but analytical properties of cross-validation methods are rarely described. Models for genetic evaluation are an oversimplification of reality that usually holds only in the short run and in closely-related populations. Their properties are rarely well known, which can lead to unexpected results. Initial applications of genomic predictions of breeding values (GEBV) in dairy cattle led to biases, with young “genomic” selected bulls with high GEBV being. We need tools to rank, understand and quantify the behavior of prediction models in an “animal breeding” context. The need for these tools has dramatically increased with the implementation of genomic selection, given its built-in. Cross-validation studies are the norm [4, 9, 10]

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Genetics Selection Evolution	Publication Date: Nov 6, 2018
Citations: 156	License type: open-access

R Discovery Prime

R Discovery Prime

Semi-parametric estimates of population accuracy and bias of predictions of breeding values and future phenotypes using the LR method

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Genetics Selection Evolution

Lead the way for us

Similar Papers

120 Cross-validation of breeding values and future phenotypes for heifer pregnancy in Red Angus cattle using the LR method
Lane K Giess ... R Mark Enns
Journal of Animal Science | VOL. 102
Lane K Giess, et. al.Lane K Giess ... R Mark Enns
13 Sep 2024
Journal of Animal Science | VOL. 102

Behavior of the Linear Regression method to estimate bias and accuracies with correct and incorrect genetic evaluation models
F.L Macedo ... A Legarra
Journal of Dairy Science | VOL. 103
F.L Macedo, et. al.F.L Macedo ... A Legarra
06 Nov 2019
Journal of Dairy Science | VOL. 103

PSXII-3 Including Non-Additive Genetic Effects in Genomic Prediction and Estimation of Variance Components for Performance and Heat Stress Traits in Pigs
Leticia F Oliveira ... Luiz F F Brito
Journal of Animal Science | VOL. 101
Leticia F Oliveira, et. al.Leticia F Oliveira ... Luiz F F Brito
06 Nov 2023
Journal of Animal Science | VOL. 101

Whole genome scan for quantitative trait loci affecting body weight in chickens using a three generation design
J.B.C.H.M Van Kaam ... A Veenendaal
Livestock Production Science | VOL. 54
J.B.C.H.M Van Kaam, et. al.J.B.C.H.M Van Kaam ... A Veenendaal
01 May 1998
Livestock Production Science | VOL. 54

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Semi-parametric estimates of population accuracy and bias of predictions of breeding values and future phenotypes using the LR method

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Genetics Selection Evolution