Abstract

BackgroundWith the availability of next-generation sequencing technologies, genomic prediction based on whole-genome sequencing (WGS) data is now feasible in animal breeding schemes and was expected to lead to higher predictive ability, since such data may contain all genomic variants including causal mutations. Our objective was to compare prediction ability with high-density (HD) array data and WGS data in a commercial brown layer line with genomic best linear unbiased prediction (GBLUP) models using various approaches to weight single nucleotide polymorphisms (SNPs).MethodsA total of 892 chickens from a commercial brown layer line were genotyped with 336 K segregating SNPs (array data) that included 157 K genic SNPs (i.e. SNPs in or around a gene). For these individuals, genome-wide sequence information was imputed based on data from re-sequencing runs of 25 individuals, leading to 5.2 million (M) imputed SNPs (WGS data), including 2.6 M genic SNPs. De-regressed proofs (DRP) for eggshell strength, feed intake and laying rate were used as quasi-phenotypic data in genomic prediction analyses. Four weighting factors for building a trait-specific genomic relationship matrix were investigated: identical weights, −(log10P) from genome-wide association study results, squares of SNP effects from random regression BLUP, and variable selection based weights (known as BLUP|GA). Predictive ability was measured as the correlation between DRP and direct genomic breeding values in five replications of a fivefold cross-validation.ResultsAveraged over the three traits, the highest predictive ability (0.366 ± 0.075) was obtained when only genic SNPs from WGS data were used. Predictive abilities with genic SNPs and all SNPs from HD array data were 0.361 ± 0.072 and 0.353 ± 0.074, respectively. Prediction with −(log10P) or squares of SNP effects as weighting factors for building a genomic relationship matrix or BLUP|GA did not increase accuracy, compared to that with identical weights, regardless of the SNP set used.ConclusionsOur results show that little or no benefit was gained when using all imputed WGS data to perform genomic prediction compared to using HD array data regardless of the weighting factors tested. However, using only genic SNPs from WGS data had a positive effect on prediction ability.

Highlights

  • With the availability of next-generation sequencing technologies, genomic prediction based on whole-genome sequencing (WGS) data is feasible in animal breeding schemes and was expected to lead to higher predictive ability, since such data may contain all genomic variants including causal mutations

  • Brøndum et al [11] showed that the reliability of Genomic prediction (GP) could be improved by adding several significant quantitative trait loci (QTL), which were detected by genome-wide association studies (GWAS) of whole-genome sequence (WGS) data, to the regular 54 K bovine array data, especially for production traits

  • The fact that we found high rates of low Rsq values within the set of single nucleotide polymorphisms (SNPs) with a low minor allele frequency (MAF) could be due to low linkage disequilibrium (LD) between these SNPs and adjacent SNPs, which can result in lower imputation accuracy [for imputation accuracies in different MAF bins] [37,38,39,40,41]

Read more

Summary

Introduction

With the availability of next-generation sequencing technologies, genomic prediction based on whole-genome sequencing (WGS) data is feasible in animal breeding schemes and was expected to lead to higher predictive ability, since such data may contain all genomic variants including causal mutations. Our objective was to compare prediction ability with high-density (HD) array data and WGS data in a commercial brown layer line with genomic best linear unbiased prediction (GBLUP) models using various approaches to weight single nucleotide polymorphisms (SNPs). In a first study using sequenced inbred lines of Drosophila melanogaster, prediction based on WGS data using ~2.5 million (M) SNPs did not increase accuracy compared to an approach using only ~5% of the segregating SNPs [8]. GP with WGS data could be attractive, so far the expectations for higher accuracies have not been realized with real data on cattle

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call