Abstract
BackgroundThe large and diverse land plant lineage is nested within a clade of fresh water green algae, the charophytes. Collection of genome-scale data for land plants and other organisms over the past decade has invigorated the field of evolutionary biology. One of the core questions in the field asks: how did a colonization event by a green algae over 450 mya lead to one of the most successful lineages on the tree of life? This question can best be answered using the comparative method, the first step of which is to gather genome-scale data across closely related lineages to land plants. Before sequencing an entire genome it is useful to first gather transcriptome data: it is less expensive, it targets the protein coding regions of the genome, and provides support for gene models for future genome sequencing. We built Expressed Sequence Tag (EST) libraries for two charophyte species, Coleochaete orbicularis (Coleochaetales) and Spirogyra pratensis (Zygnematales). We used both Sanger sequencing and next generation 454 sequencing to cover as much of the transcriptome as possible.ResultsOur sequencing effort for Spirogyra pratensis yielded 9,984 5' Sanger reads plus 598,460 GS FLX Standard 454 sequences; Coleochaete orbicularis yielded 4,992 5' Sanger reads plus 673,811 GS FLX Titanium 454 sequences. After clustering S. pratensis yielded 12,000 unique transcripts, or unigenes, and C. orbicularis yielded 19,000. Both transcriptomes were very plant-like, i.e. most of the transcripts were more similar to streptophytes (land plants + charophyte green algae) than to other green algae in the sister group chlorophytes. BLAST results of several land plant genes hypothesized to be important in early land plant evolution resulted in high quality hits in both transcriptomes revealing putative orthologs ripe for follow-up studies.ConclusionsTwo main conclusions were drawn from this study. One illustrates the utility of next generation sequencing for transcriptome studies: larger scale data collection at a lower cost enabled us to cover a considerable portion of the transcriptome for both species. And, two, that the charophyte green algal transcriptoms are remarkably plant-like, which gives them the unique capacity to be major players for future evolutionary genomic studies addressing origin of land plant questions.
Highlights
The large and diverse land plant lineage is nested within a clade of fresh water green algae, the charophytes
S. pratensis Number of reads Average length GC content clustered by Agencourt
Algal cultures were grown and collected in various life stages as well as at different times during the day. All of these factors were part of an effort to maximize the total number of transcripts available for each pulled Expressed Sequence Tag (EST) library
Summary
The large and diverse land plant lineage is nested within a clade of fresh water green algae, the charophytes. We built Expressed Sequence Tag (EST) libraries for two charophyte species, Coleochaete orbicularis (Coleochaetales) and Spirogyra pratensis (Zygnematales). We used both Sanger sequencing and generation 454 sequencing to cover as much of the transcriptome as possible. The tremendous diversity we see in land plants today--from mosses to redwoods and orchids--all descended from a single common. Both phylogenetic and fossil evidence suggest that these orders are extremely old lineages, comparable in age to the land plants [2]. To move toward comprehensive genomic analysis of charophytes, we undertook EST analysis of two representative charophytes, Spirogyra pratensis and Coleochaete orbicularis
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have