Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle.

Rianne Van Binsbergen,Chris Schrooten,Fred A Van Eeuwijk,Marco C A M Bink,Mario P L Calus,Roel F Veerkamp

doi:10.1186/s12711-015-0149-x

Rianne Van Binsbergen, Chris Schrooten + Show 4 more

Open Access

https://doi.org/10.1186/s12711-015-0149-x

Copy DOI

Abstract

BackgroundIn contrast to currently used single nucleotide polymorphism (SNP) panels, the use of whole-genome sequence data is expected to enable the direct estimation of the effects of causal mutations on a given trait. This could lead to higher reliabilities of genomic predictions compared to those based on SNP genotypes. Also, at each generation of selection, recombination events between a SNP and a mutation can cause decay in reliability of genomic predictions based on markers rather than on the causal variants. Our objective was to investigate the use of imputed whole-genome sequence genotypes versus high-density SNP genotypes on (the persistency of) the reliability of genomic predictions using real cattle data.MethodsHighly accurate phenotypes based on daughter performance and Illumina BovineHD Beadchip genotypes were available for 5503 Holstein Friesian bulls. The BovineHD genotypes (631,428 SNPs) of each bull were used to impute whole-genome sequence genotypes (12,590,056 SNPs) using the Beagle software. Imputation was done using a multi-breed reference panel of 429 sequenced individuals. Genomic estimated breeding values for three traits were predicted using a Bayesian stochastic search variable selection (BSSVS) model and a genome-enabled best linear unbiased prediction model (GBLUP). Reliabilities of predictions were based on 2087 validation bulls, while the other 3416 bulls were used for training.ResultsPrediction reliabilities ranged from 0.37 to 0.52. BSSVS performed better than GBLUP in all cases. Reliabilities of genomic predictions were slightly lower with imputed sequence data than with BovineHD chip data. Also, the reliabilities tended to be lower for both sequence data and BovineHD chip data when relationships between training animals were low. No increase in persistency of prediction reliability using imputed sequence data was observed.ConclusionsCompared to BovineHD genotype data, using imputed sequence data for genomic prediction produced no advantage. To investigate the putative advantage of genomic prediction using (imputed) sequence data, a training set with a larger number of individuals that are distantly related to each other and genomic prediction models that incorporate biological information on the SNPs or that apply stricter SNP pre-selection should be considered.Electronic supplementary materialThe online version of this article (doi:10.1186/s12711-015-0149-x) contains supplementary material, which is available to authorized users.

Highlights

Genomic selection is increasingly applied in breeding programs for livestock and plant species, e.g. [1,2,3,4]
Inclusion of the causal mutations allows the effect of the quantitative trait loci (QTL) on a given trait to be estimated directly, which should increase the reliability of genomic predictions compared to using single nucleotide polymorphism (SNP) genotypes, as well as the persistency of the reliability of predictions across generations and even across breeds [11,12,13]
Descriptive results After editing SNPs for minor allele frequency (MAF) and imputation reliability, the final BovineHD and ImputedHD genotype dataset consisted of 631,428 SNPs and the imputed sequence genotype dataset of 12,590,056 SNPs

Summary

Introduction

Genomic selection is increasingly applied in breeding programs for livestock and plant species, e.g. [1,2,3,4]. Inclusion of the causal mutations allows the effect of the QTL on a given trait to be estimated directly, which should increase the reliability of genomic predictions compared to using SNP genotypes, as well as the persistency of the reliability of predictions across generations and even across breeds [11,12,13]. In contrast to currently used single nucleotide polymorphism (SNP) panels, the use of whole-genome sequence data is expected to enable the direct estimation of the effects of causal mutations on a given trait. This could lead to higher reliabilities of genomic predictions compared to those based on SNP genotypes. Our objective was to investigate the use of imputed whole-genome sequence genotypes versus high-density SNP genotypes on (the persistency of) the reliability of genomic predictions using real cattle data

Objectives

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Genetics Selection Evolution	Publication Date: Sep 17, 2015
Citations: 152	License type: cc-by

R Discovery Prime

R Discovery Prime

Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Genetics Selection Evolution

Lead the way for us

Similar Papers

Genomic prediction based on selected variants from imputed whole-genome sequence data in Australian sheep populations
Nasir Moghaddar ... Julius H J Van Der Werf
Genetics Selection Evolution | VOL. 51
Nasir Moghaddar, et. al.Nasir Moghaddar ... Julius H J Van Der Werf
01 Dec 2019
Genetics Selection Evolution | VOL. 51

Genomic selection in farm animals: accuracy of prediction and applications with imputed whole-genome sequencing data in chicken
Guiyan Ni
-
Guiyan NiGuiyan Ni
21 Feb 2022
21 Feb 2022

Genome-wide association study and genomic prediction for intramuscular fat content in Suhuai pigs using imputed whole-genome sequencing data.
Binbin Wang ... Qiang Li
Evolutionary applications | VOL. 15
Binbin Wang, et. al.Binbin Wang ... Qiang Li
24 Oct 2022
Evolutionary applications | VOL. 15

Strategies for Obtaining and Pruning Imputed Whole-Genome Sequence Data for Genomic Prediction
Shaopan Ye ... Xiaolong Yuan
Frontiers in Genetics | VOL. 10
Shaopan Ye, et. al.Shaopan Ye ... Xiaolong Yuan
17 Jul 2019
Frontiers in Genetics | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Genetics Selection Evolution