Abstract

BackgroundWith the advance of new massively parallel genotyping technologies, quantitative trait loci (QTL) fine mapping and map-based cloning become more achievable in identifying genes for important and complex traits. Development of high-density genetic markers in the QTL regions of specific mapping populations is essential for fine-mapping and map-based cloning of economically important genes. Single nucleotide polymorphisms (SNPs) are the most abundant form of genetic variation existing between any diverse genotypes that are usually used for QTL mapping studies. The massively parallel sequencing technologies (Roche GS/454, Illumina GA/Solexa, and ABI/SOLiD), have been widely applied to identify genome-wide sequence variations. However, it is still remains unclear whether sequence data at a low sequencing depth are enough to detect the variations existing in any QTL regions of interest in a crop genome, and how to prepare sequencing samples for a complex genome such as soybean. Therefore, with the aims of identifying SNP markers in a cost effective way for fine-mapping several QTL regions, and testing the validation rate of the putative SNPs predicted with Solexa short sequence reads at a low sequencing depth, we evaluated a pooled DNA fragment reduced representation library and SNP detection methods applied to short read sequences generated by Solexa high-throughput sequencing technology.ResultsA total of 39,022 putative SNPs were identified by the Illumina/Solexa sequencing system using a reduced representation DNA library of two parental lines of a mapping population. The validation rates of these putative SNPs predicted with low and high stringency were 72% and 85%, respectively. One hundred sixty four SNP markers resulted from the validation of putative SNPs and have been selectively chosen to target a known QTL, thereby increasing the marker density of the targeted region to one marker per 42 K bp.ConclusionsWe have demonstrated how to quickly identify large numbers of SNPs for fine mapping of QTL regions by applying massively parallel sequencing combined with genome complexity reduction techniques. This SNP discovery approach is more efficient for targeting multiple QTL regions in a same genetic population, which can be applied to other crops.

Highlights

  • With the advance of new massively parallel genotyping technologies, quantitative trait loci (QTL) fine mapping and map-based cloning become more achievable in identifying genes for important and complex traits

  • To identify Single nucleotide polymorphisms (SNPs) markers in a cost effective way for fine-mapping of several QTL regions and determine the false call rate of putative SNPs at a low sequencing depth, we evaluated a pooled DNA fragment reduced representation library and the SNP detection method applied to short read sequences generated by Solexa massively parallel sequencing technology

  • To verify in silico digestion results we tested the real digestions of genomic DNA of Forrest, Williams 82, and PI 342622 (G. soja) with three enzymes (CviR I, Hae III, and Rsa I) which produced the number of fragments more close to what we expected (Figure 2)

Read more

Summary

Introduction

With the advance of new massively parallel genotyping technologies, quantitative trait loci (QTL) fine mapping and map-based cloning become more achievable in identifying genes for important and complex traits. The massively parallel sequencing technologies (Roche GS/454, Illumina GA/Solexa, and ABI/ SOLiD), have been widely applied to identify genome-wide sequence variations It is still remains unclear whether sequence data at a low sequencing depth are enough to detect the variations existing in any QTL regions of interest in a crop genome, and how to prepare sequencing samples for a complex genome such as soybean. With the recent advances in massively parallel sequencing technologies and the availability of whole genome sequence, the identification of a large number of polymorphic markers by re-sequencing different genotypes in model species such as human [8] and Arabidopsis [9], is changing the landscape of genetics, which can provide molecular genetic markers and insights into gene function [10]. Large-scale identification of SNPs by massively parallel sequencing technology for crop species is still a challenging endeavor because of genome complexity and the cost for whole genome deep-coverage re-sequencing

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call