Abstract

Wheat (Triticum aestivum L.) is a staple food for more than 35% of the world's population, with flour used to make hundreds of baked goods. Superior end-use quality is a major breeding target, however, improving it is especially time-consuming and expensive. Furthermore, genes encoding seed-storage proteins (SSPs) form multi-gene families and are repetitive, with gaps commonplace in several genome assemblies. To overcome these barriers and efficiently identify superior wheat SSP alleles, we developed ‘PanSK’ (Pan-SSP k-mer) for genotype-to-phenotype prediction based on a SSP-based pangenome resource. PanSK uses 29-mer sequences that represent each SSP gene at the pangenomic level to reveal untapped diversity across landraces and modern cultivars. Genome-wide association studies with k-mer identified 23 SSP genes associated with end-use quality representing novel targets for improvement. We evaluated the effect of rye secalin genes on end-use quality and found that removing ω-secalins from 1BL/1RS wheat translocation lines is associated with enhanced end-use quality. Finally, using machine-learning-based prediction inspired by PanSK, we predict quality phenotypes with high accuracy from genotype alone. This study provides an effective approach for genome design based on SSP genes, enabling breeding of wheat varieties with superior processing capabilities and improved end-use quality.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call