Assessing quantitative disease resistance and other complex traits can be time- and space-consuming, limiting the number of lines that can be evaluated in a breeding program. A potential improvement is the application of genomic prediction models, where genotypic data are used to calculate genomic estimated breeding values, referred to as genomic selection. Genomic prediction uses training datasets, where a germplasm panel is phenotyped and genotyped to calculate genomic estimated breeding values in a validation panel of lines based solely on genotypic data. To develop an initial phenotypic dataset, breeders may consider utilizing previously phenotyped lines in a publicly available dataset, such as plant introductions (PIs) from the USDA Soybean collection. A relevant question is, how effective is genomic prediction across diverse training and prediction panels? To answer this, we used previously collected phenotypic data for quantitative disease resistance toward Phytophthora sojae. Diverse germplasm panels were represented by sets of PIs originating from the United States, the Republic of Korea, and worldwide, for a total collection of 1,768 PIs. The accuracy of the prediction model was significantly influenced by population differentiation between panels, as well as genetic and phenotypic diversity. Low prediction accuracy resulted when the panel used to create the prediction model was highly differentiated from the validation panel. However, the addition of a small number of accessions to the training panel that were more closely related to the validation panel resulted in increased accuracy. [Formula: see text] Copyright © 2024 The Author(s). This is an open access article distributed under the CC BY-NC-ND 4.0 International license .
Read full abstract