Abstract

Although yield trials for switchgrass (Panicum virgatum L.), a potentially high value biofuel feedstock crop, are currently underway throughout North America, the genetic tools for crop improvement in this species are still in the early stages of development. Identification of high-density molecular markers, such as single nucleotide polymorphisms (SNPs), that are amenable to high-throughput genotyping approaches, is the first step in a quantitative genetics study of this model biofuel crop species. We generated and sequenced expressed sequence tag (EST) libraries from thirteen diverse switchgrass cultivars representing both upland and lowland ecotypes, as well as tetraploid and octoploid genomes. We followed this with reduced genomic library preparation and massively parallel sequencing of the same samples using the Illumina Genome Analyzer technology platform. EST libraries were used to generate unigene clusters and establish a gene-space reference sequence, thus providing a framework for assembly of the short sequence reads. SNPs were identified utilizing these scaffolds. We used a custom software program for alignment and SNP detection and identified over 149,000 SNPs across the 13 short-read sequencing libraries (SRSLs). Approximately 25,000 additional SNPs were identified from the entire EST collection available for the species. This sequencing effort generated data that are suitable for marker development and for estimation of population genetic parameters, such as nucleotide diversity and linkage disequilibrium. Based on these data, we assessed the feasibility of genome wide association mapping and genomic selection applications in switchgrass. Overall, the SNP markers discovered in this study will help facilitate quantitative genetics experiments and greatly enhance breeding efforts that target improvement of key biofuel traits and development of new switchgrass cultivars.

Highlights

  • Switchgrass (Panicum virgatumL.) is a perennial C4 warm-season grass native to North America, where it occurs naturally from 55u N latitude to deep into Mexico, mostly as a dominant species of the tall grass prairies

  • It is possible to anchor some portion of short-read sequences to a related taxon for which a complete genome sequence is available, this is undesirable for single nucleotide polymorphisms (SNPs) discovery

  • SNP discovery algorithms often prevent SNP calling against the reference sequence when no reads representing the reference allele are present in the assembly

Read more

Summary

Introduction

Switchgrass (Panicum virgatumL.) is a perennial C4 warm-season grass native to North America, where it occurs naturally from 55u N latitude to deep into Mexico, mostly as a dominant species of the tall grass prairies. Two genetically and phenotypically distinct switchgrass ecotypes, lowland and upland, were identified in early genetic screening studies. They are distinguished by a number of morphological traits and their natural habitat. The lowland ecotype has a taller, coarser, upright phenotype, with a more rapid growth habit compared to the upland ecotype, and is generally found in wetter habitats, such as floodplains. With respect to genetic distinguishing features such as ploidy levels, lowland switchgrass ecotypes are mostly tetraploid (2n = 4x = 36), whereas upland switchgrass ecotypes are much more complex in their ploidy levels, and generally display higher orders of ploidy. The level of natural gene flow between the ecotypes is unknown. Most of the recent research and breeding has focused on the lowland ecotype, with its stable, simpler genome and its high yield potential, in warmer parts of the US, our project targeted northern-adapted, upland germplasm

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call