Abstract
Euglenids are a group of protists that comprises species with diverse feeding modes. One distinct and diversified clade of euglenids is photoautotrophic, and its members bear green secondary plastids. In this paper we present the plastid genome of the euglenid Eutreptiella, which we assembled from 454 sequencing of Eutreptiella gDNA. Comparison of this genome and the only other available plastid genomes of photosynthetic euglenid, Euglena gracilis, revealed that they contain a virtually identical set of 57 protein coding genes, 24 genes fewer than the genome of Pyramimonas parkeae, the closest extant algal relative of the euglenid plastid. Searching within the transcriptomes of Euglena and Eutreptiella showed that 6 of the missing genes were transferred to the nucleus of the euglenid host while 18 have been probably lost completely. Euglena and Eutreptiella represent the deepest bifurcation in the photosynthetic clade, and therefore all these gene transfers and losses must have happened before the last common ancestor of all known photosynthetic euglenids. After the split of Euglena and Eutreptiella only one additional gene loss took place. The conservation of gene content in the two lineages of euglenids is in contrast to the variability of gene order and intron counts, which diversified dramatically. Our results show that the early secondary plastid of euglenids was much more susceptible to gene losses and endosymbiotic gene transfers than the established plastid, which is surprisingly resistant to changes in gene content.
Highlights
Euglenids are a relatively large group of protists that contains species with different types of feeding strategies: some euglenid species (e.g. Rhabdomonas) are osmotrophic and feed by pinocytosis; others developed phagotrophic apparatuses for catching bacteria (e.g. Entosiphon) or even eukaryotes (e.g. Peranema) [1,2]
The genome sequence is numbered from the first nucleotide after the second 23S RNA gene
The organization of the genome resembles those of higher plants and algae with a large single copy region (LSC), a small single copy region (SSC) and two inverted repeats (IR)
Summary
Euglenids are a relatively large group of protists that contains species with different types of feeding strategies: some euglenid species (e.g. Rhabdomonas) are osmotrophic and feed by pinocytosis; others developed phagotrophic apparatuses for catching bacteria (e.g. Entosiphon) or even eukaryotes (e.g. Peranema) [1,2]. Complete plastid genome sequences are known so far for only two closely related euglenid species, Euglena gracilis [6] and Euglena longa [7]. There are indications that the expansion of introns may be a feature specific to E. gracilis and its relatives [17,18]; no other plastid genome of euglenids has been completely sequenced, which would be necessary to enable comprehensive comparisons.
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have