Abstract

Recent plant breeding studies of several species have demonstrated the utility of combining molecular assessments of genetic distance into trait-linked SNP genotyping during the development of parent lines to maximize yield gains due to heterosis. SSRs (Short Sequence Repeats) are the molecular marker of choice to determine genetic diversity, but the methods historically used to sequence them have been burdensome. The ability to analyze SSRs in a higher-throughput manner independent of laboratory conditions would increase their utility in molecular ecology, germplasm curation, and plant breeding programs worldwide. This project reports simple bioinformatics methods that can be used to generate genome-wide de novo SSRs in silico followed by targeted Next Generation Sequencing (NGS) validation of those that provide the most information about sub-population identity of a breeding line, which influences heterotic group selection. While these methods were optimized in sorghum [Sorghum bicolor (L.) Moench], they were developed to be applied to any species with a reference genome and high-coverage whole-genome sequencing data of individuals from the sub-populations to be characterized. An analysis of published sorghum genomes selected to represent its five main races (bicolor, caudatum, durra, kafir, and guinea; 75 accessions total) identified 130,120 SSR motifs. Average lengths were 23.8 bp and 95% were between 10 and 92 bp, making them suitable for NGS. Validation through targeted sequencing amplified 188 of 192 assayed SSR loci. Results highlighted the distinctness of accessions from the guinea sub-group margaritiferum from all other sorghum accessions, consistent with previous studies of nuclear and mitochondrial DNA. SSRs that efficiently fingerprinted margaritiferum individuals (Xgma1 –Xgma6) are presented. Developing similar fingerprints of other sub-populations (Xunr1 –Xunr182) was not possible due to the extensive admixture between them in the data set analyzed. In summary, these methods were able to fingerprint specific sub-populations when rates of admixture between them are low.

Highlights

  • Originating in Africa, sorghum [Sorghum bicolor (L.) Moench] (2n = 2x = 20) is a drought-tolerant C4 grass species having wide genetic diversity [1]

  • There is an increasing amount of genomic sequence data available in sorghum that can be used to construct SNPbased haplotypes to differentiate between races and working groups at the genomic level, but this has yet to be leveraged efficiently in the development of molecular markers that could be used by breeders

  • In order to investigate genetic diversity in sorghum, a re-analysis was performed of whole-genome sequencing (WGS) data from 75 published accessions [1,22,25], generating a map of 19,230,634 variants at 18,299,015 sites across the S. bicolor (Sbicolor_454_v3.0.1) genome [25]

Read more

Summary

Introduction

A working group within guinea, known as margaritiferum, has been shown to be distinct from the rest of the guinea race by both mitochondrial [5] and genomic [6,7,8] DNA analyses, and contains a large amount of untapped genetic diversity [9]. This diversity notably includes many rare alleles [8], as well as desirable traits such as high nitrogen use efficiency [10] and aluminum tolerance [11]. Recent studies [16,17] have emphasized the need for better molecular markers than SNP haplotypes to more accurately and consistently assign accessions to a race or working group to distinguish among sub-populations of sorghum and assist in developing better heterotic groups

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call