Is single-step genomic REML with the algorithm for proven and young more computationally efficient when less generations of data are present?

Vinícius Silva Junqueira,Fernando Flores Cardoso,Paulo Sávio Lopes,Daniela Lourenco,Ignacy Misztal,Yutaka Masuda,Fabyano Fonseca E Silva

doi:10.1093/jas/skac082

Vinícius Silva Junqueira, Fernando Flores Cardoso + Show 5 more

Open Access

https://doi.org/10.1093/jas/skac082

Copy DOI

Abstract

Efficient computing techniques allow the estimation of variance components for virtually any traditional dataset. When genomic information is available, variance components can be estimated using genomic REML (GREML). If only a portion of the animals have genotypes, single-step GREML (ssGREML) is the method of choice. The genomic relationship matrix (G) used in both cases is dense, limiting computations depending on the number of genotyped animals. The algorithm for proven and young (APY) can be used to create a sparse inverse of G () with close to linear memory and computing requirements. In ssGREML, the inverse of the realized relationship matrix (H−1) also includes the inverse of the pedigree relationship matrix, which can be dense with a long pedigree, but sparser with short. The main purpose of this study was to investigate whether costs of ssGREML can be reduced using APY with truncated pedigree and phenotypes. We also investigated the impact of truncation on variance components estimation when different numbers of core animals are used in APY. Simulations included 150K animals from 10 generations, with selection. Phenotypes (h2 = 0.3) were available for all animals in generations 1–9. A total of 30K animals in generations 8 and 9, and 15K validation animals in generation 10 were genotyped for 52,890 SNP. Average information REML and ssGREML with G−1 and using 1K, 5K, 9K, and 14K core animals were compared. Variance components are impacted when the core group in APY represents the number of eigenvalues explaining a small fraction of the total variation in G. The most time-consuming operation was the inversion of G, with more than 50% of the total time. Next, numerical factorization consumed nearly 30% of the total computing time. On average, a 7% decrease in the computing time for ordering was observed by removing each generation of data. APY can be successfully applied to create the inverse of the genomic relationship matrix used in ssGREML for estimating variance components. To ensure reliable variance component estimation, it is important to use a core size that corresponds to the number of largest eigenvalues explaining around 98% of total variation in G. When APY is used, pedigrees can be truncated to increase the sparsity of H and slightly reduce computing time for ordering and symbolic factorization, with no impact on the estimates.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of animal science	Publication Date: Mar 15, 2022
Citations: 6	License type: CC BY-NC 4.0

R Discovery Prime

R Discovery Prime

Is single-step genomic REML with the algorithm for proven and young more computationally efficient when less generations of data are present?

Abstract

Talk to us

Similar Papers

More From: Journal of animal science

Lead the way for us

Similar Papers

Optimisation of the core subset for the APY approximation of genomic relationships
Ivan Pocrnic ... William O. Herring
Genetics Selection Evolution | VOL. 54
Ivan Pocrnic, et. al.Ivan Pocrnic ... William O. Herring
22 Nov 2022
Genetics Selection Evolution | VOL. 54

Solving efficiently large single-step genomic best linear unbiased prediction models.
I Strandén ... K Matilainen
Journal of Animal Breeding and Genetics | VOL. 134
I Strandén, et. al.I Strandén ... K Matilainen
15 May 2017
Journal of Animal Breeding and Genetics | VOL. 134

Core-dependent changes in genomic predictions using the Algorithm for Proven and Young in single-step genomic best linear unbiased prediction.
Ignacy Misztal ... Daniela Lourenco
Journal of animal science | VOL. 98
Ignacy Misztal, et. al.Ignacy Misztal ... Daniela Lourenco
19 Nov 2020
Journal of animal science | VOL. 98

A comprehensive study on size and definition of the core group in the proven and young algorithm for single-step GBLUP
Rostam Abdollahi-Arpanahi ... Daniela Lourenco
Genetics Selection Evolution | VOL. 54
Rostam Abdollahi-Arpanahi, et. al.Rostam Abdollahi-Arpanahi ... Daniela Lourenco
20 May 2022
Genetics Selection Evolution | VOL. 54

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Is single-step genomic REML with the algorithm for proven and young more computationally efficient when less generations of data are present?

Abstract

Talk to us

Similar Papers

More From: Journal of animal science