Abstract
Recent advances in next generation sequencing technologies make genotyping by sequencing (GBS) more feasible for molecular characterization of plant germplasm with complex and unsequenced genomes. We used a GBS protocol consisting of Roche 454 pyrosequencing, genomic reduction and advanced bioinformatics tools to analyze genetic diversity of 24 diverse yellow mustard accessions. One and one half 454 pyrosequencing runs generated roughly 1.2 million sequence reads totaling about 392 million nucleotides. Application of the computational pipeline DIAL identified 512 contigs and 828 SNPs. The BLAST algorithm revealed alignments of 214 contigs with the sequences reported in NCBI nr/nt database. Sanger sequencing confirmed 95 % of 41 selected contigs and 94 % of 240 putative SNPs. The 454 scored SNPs were highly imbalanced among assayed samples. Diversity analysis of these SNPs revealed that 26.1 % of the total variation resided among landrace, cultivar and breeding lines and 24.7 % between yellow- and black-seeded germplasm. Cluster analysis showed that the black-seeded accessions were largely clustered together and the breeding lines were grouped with known origin. Computer simulation was performed to assess the impact of 454 SNPs missing and revealed considerable changes in allelic count, bias in detection of genetic structure, and large deviations from the expected genetic-distance matrix. These findings are useful for parental selection consideration in yellow mustard breeding, and our detailed analyses help illustrate the utility of GBS in genetic-diversity analysis of plant germplasm, particularly for genetic-relationship assessment.
Highlights
Cluster analysis showed that the black-seeded accessions were largely clustered together and the breeding lines were grouped with known origin
This study represented the first attempt to characterize yellow mustard germplasm based on a genotyping by sequencing (GBS) protocol
Our GBS application generated a novel set of genomic resources useful for further genomic analysis of yellow mustard, and provided a large genotypic data set for genetic-diversity analyses of the assayed samples
Summary
The GBS approach starts by reducing genome complexity with restriction enzymes, barcoding enzyme-cut genomic DNAs with indexed adaptors, multiplex-sequencing the barcoded DNA fragments in high-throughput NGS platforms, followed by a bioinformatics analysis of indexed sequence reads to identify genetic variants and a genetic diversity analysis of assayed samples. This approach requires no prior sequencing of plant genome, provides direct genotyping of plants with complex genomes without prior SNP discovery, and is capable of producing high-density, low-cost genotype information. Very few studies have been conducted to assess the effectiveness and informativeness of GBS for genomic characterization of plant germplasm (Fu and Peterson 2012)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.