Genomic prediction using preselected DNA variants from a GWAS with whole-genome sequence data in Holstein-Friesian cattle.

Roel F Veerkamp,Chris Schrooten,Aniek C Bouwman,Mario P L Calus

doi:10.1186/s12711-016-0274-1

Roel F Veerkamp, Chris Schrooten + Show 2 more

Open Access

https://doi.org/10.1186/s12711-016-0274-1

Copy DOI

Journal: Genetics Selection Evolution	Publication Date: Dec 1, 2016
Citations: 91	License type: cc-by

Affiliation: CRV (Netherlands)

Abstract

BackgroundWhole-genome sequence data is expected to capture genetic variation more completely than common genotyping panels. Our objective was to compare the proportion of variance explained and the accuracy of genomic prediction by using imputed sequence data or preselected SNPs from a genome-wide association study (GWAS) with imputed whole-genome sequence data.MethodsPhenotypes were available for 5503 Holstein–Friesian bulls. Genotypes were imputed up to whole-genome sequence (13,789,029 segregating DNA variants) by using run 4 of the 1000 bull genomes project. The program GCTA was used to perform GWAS for protein yield (PY), somatic cell score (SCS) and interval from first to last insemination (IFL). From the GWAS, subsets of variants were selected and genomic relationship matrices (GRM) were used to estimate the variance explained in 2087 validation animals and to evaluate the genomic prediction ability. Finally, two GRM were fitted together in several models to evaluate the effect of selected variants that were in competition with all the other variants.ResultsThe GRM based on full sequence data explained only marginally more genetic variation than that based on common SNP panels: for PY, SCS and IFL, genomic heritability improved from 0.81 to 0.83, 0.83 to 0.87 and 0.69 to 0.72, respectively. Sequence data also helped to identify more variants linked to quantitative trait loci and resulted in clearer GWAS peaks across the genome. The proportion of total variance explained by the selected variants combined in a GRM was considerably smaller than that explained by all variants (less than 0.31 for all traits). When selected variants were used, accuracy of genomic predictions decreased and bias increased.ConclusionsAlthough 35 to 42 variants were detected that together explained 13 to 19% of the total variance (18 to 23% of the genetic variance) when fitted alone, there was no advantage in using dense sequence information for genomic prediction in the Holstein data used in our study. Detection and selection of variants within a single breed are difficult due to long-range linkage disequilibrium. Stringent selection of variants resulted in more biased genomic predictions, although this might be due to the training population being the same dataset from which the selected variants were identified.Electronic supplementary materialThe online version of this article (doi:10.1186/s12711-016-0274-1) contains supplementary material, which is available to authorized users.

Highlights

Whole-genome sequence data is expected to capture genetic variation more completely than common genotyping panels
Several reasons may explain why the accuracy of genomic predictions does not increase when sequence data is used: (1) if the number of training individuals is small, the effects of quantitative trait loci (QTL) may be estimated with too large errors and little advantage is gained by using sequence data [10]; (2) if training is performed within a breed or line, long-range linkage disequilibrium (LD) may prevent the precise localisation of quantitative trait nucleotides (QTN) when all sequence variants are fitted simultaneously [8]; and (3) many different linear combinations of variants may occur and result in accurate genomic predictions for the same set of phenotypes
Our objective was to compare the proportion of variance explained and the accuracy of genomic prediction based on imputed sequence data, lower density single nucleotide polymorphism (SNP) panels, and preselected variants from a genome-wide association study (GWAS) based on imputed whole-genome sequence

Summary

Introduction

Whole-genome sequence data is expected to capture genetic variation more completely than common genotyping panels. Veerkamp et al Genet Sel Evol (2016) 48:95 density SNP chips [4] and that the reliability of genomic predictions and its persistency across generations and even across breeds [5, 6] will improve This was confirmed on simulated data [7], but in practice, the use of cattle and chicken sequence data has not increased the reliability of genomic predictions [8, 9]. It might be better to use fewer variants that are located closer to the QTN, than to rely on the complex LD structure between variants for the prediction of selection candidates This was found in a simulation study for across-breed prediction by Wientjes et al [11]

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Genomic prediction using preselected DNA variants from a GWAS with whole-genome sequence data in Holstein-Friesian cattle.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Genetics Selection Evolution

Lead the way for us

Similar Papers

Genomic selection in farm animals: accuracy of prediction and applications with imputed whole-genome sequencing data in chicken
Guiyan Ni
-
Guiyan NiGuiyan Ni
21 Feb 2022
21 Feb 2022

Pre-selecting markers based on fixation index scores improved the power of genomic evaluations in a combined Yorkshire pig population
S Ye ... J Li
Animal | VOL. 14
S Ye, et. al.S Ye ... J Li
01 Jan 2020
Animal | VOL. 14

Increased accuracy of genomic predictions for growth under chronic thermal stress in rainbow trout by prioritizing variants from GWAS using imputed sequence data.
Grazyella M Yoshida ... José M Yáñez
Evolutionary applications | VOL. 15
Grazyella M Yoshida, et. al.Grazyella M Yoshida ... José M Yáñez
18 May 2021
Evolutionary applications | VOL. 15

Genomic prediction based on selected variants from imputed whole-genome sequence data in Australian sheep populations
Nasir Moghaddar ... Julius H J Van Der Werf
Genetics Selection Evolution | VOL. 51
Nasir Moghaddar, et. al.Nasir Moghaddar ... Julius H J Van Der Werf
01 Dec 2019
Genetics Selection Evolution | VOL. 51

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Genomic prediction using preselected DNA variants from a GWAS with whole-genome sequence data in Holstein-Friesian cattle.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Genetics Selection Evolution