Abstract

BackgroundTomato species are of significant agricultural and ecological interest, with cultivated tomato being among the most common vegetable crops grown. Wild tomato species are native to diverse habitats in South America and show great morphological and ecological diversity that has proven useful in breeding programs. However, relatively little is known about nucleotide diversity between tomato species. Until recently limited sequence information was available for tomato, preventing genome-wide evolutionary analyses. Now, an extensive collection of tomato expressed sequence tags (ESTs) is available at the SOL Genomics Network (SGN). This database holds sequences from several species, annotated with quality values, assembled into unigenes, and tested for homology against other genomes. Despite the importance of polymorphism detection for breeding and natural variation studies, such analyses in tomato have mostly been restricted to cultivated accessions. Importantly, previous polymorphisms surveys mostly ignored the linked meta-information, limiting functional and evolutionary analyses. The current data in SGN is thus an under-exploited resource. Here we describe a cross-species analysis taking full-advantage of available information.ResultsWe mined 20,000 interspecific polymorphisms between Solanum lycopersicum and S. habrochaites or S. pennellii and 28,800 intraspecific polymorphisms within S. lycopersicum. Using the available meta-information we classified genes into functional categories and obtained estimations of single nucleotide polymorphisms (SNP) quality, position in the gene, and effect on the encoded proteins, allowing us to perform evolutionary analyses. Finally, we developed a set of more than 10,000 between-species molecular markers optimized by sequence quality and predicted intron position. Experimental validation of 491 of these molecular markers resulted in confirmation of 413 polymorphisms.ConclusionWe present a new analysis of the extensive tomato EST sequences available that represents the most comprehensive survey of sequence diversity across Solanum species to date. These SNPs, plus thousands of molecular makers designed to detect the polymorphisms are available to the community via a website. Evolutionary analyses on these polymorphism uncovered sets of genes potentially important for the evolution and domestication of tomato; interestingly these sets were enriched for genes involved in response to the environment.

Highlights

  • Tomato species are of significant agricultural and ecological interest, with cultivated tomato being among the most common vegetable crops grown

  • Expressed sequence tag (EST) assembly The unigene database version 200607 build 1 from SOL Genomics Network (SGN) consists of 239,172 ESTs grouped into 34,829 unigenes, each one containing between 1 and 1087 ESTs, with an average of 6.9 ESTs per unigene

  • We report in this work more than forty-nine thousand inter and intraspecific polymorphisms mined from the EST databases of the cultivated and two wild species of tomato

Read more

Summary

Introduction

Tomato species are of significant agricultural and ecological interest, with cultivated tomato being among the most common vegetable crops grown. Since the tomato genome sequence is currently incomplete and the microarray platforms for this species do not feature most of the loci predicted to exist [5,6,7], genetic studies have focused on the analysis of particular loci and segregating populations. These studies led to in-depth information on a few loci and uncovered the potential usefulness of the natural variation existing in related species [3,4]. A major goal for tomato geneticists is the acquisition of comprehensive genome-wide information that can be used in the improvement of resistance, quality, aspect, flavor and growth in cultivated varieties [8]

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call