Abstract

BackgroundThe human pathogen Trichomonas vaginalis is a parabasalian flagellate that is estimated to infect 3% of the world’s population annually. With a 160 megabase genome and up to 60,000 genes residing in six chromosomes, the parasite has the largest genome among sequenced protists. Although it is thought that the genome size and unusual large coding capacity is owed to genome duplication events, the exact reason and its consequences are less well studied.ResultsAmong transcriptome data we found thousands of instances, in which reads mapped onto genomic loci not annotated as genes, some reaching up to several kilobases in length. At first sight these appear to represent long non-coding RNAs (lncRNAs), however, about half of these lncRNAs have significant sequence similarities to genomic loci annotated as protein-coding genes. This provides evidence for the transcription of hundreds of pseudogenes in the parasite. Conventional lncRNAs and pseudogenes are expressed in Trichomonas through their own transcription start sites and independently from flanking genes in Trichomonas. Expression of several representative lncRNAs was verified through reverse-transcriptase PCR in different T. vaginalis strains and case studies exclude the use of alternative start codons or stop codon suppression for the genes analysed.ConclusionOur results demonstrate that T. vaginalis expresses thousands of intergenic loci, including numerous transcribed pseudogenes. In contrast to yeast these are expressed independently from neighbouring genes. Our results furthermore illustrate the effect genome duplication events can have on the transcriptome of a protist. The parasite’s genome is in a steady state of changing and we hypothesize that the numerous lncRNAs could offer a large pool for potential innovation from which novel proteins or regulatory RNA units could evolve.Electronic supplementary materialThe online version of this article (doi:10.1186/1471-2164-15-906) contains supplementary material, which is available to authorized users.

Highlights

  • The human pathogen Trichomonas vaginalis is a parabasalian flagellate that is estimated to infect 3% of the world’s population annually

  • Genes of the Argonaute (AGO) and Dicer-like family are encoded by Trichomonas and suggest the existence of functional RNA interference mechanisms [5,14], other studies question the functionality of identified miRNAs in this parasite [19]

  • No analysis dedicated to the characterization of long non-coding RNAs (lncRNAs) or pseudogene expression in protists apart from yeast [27,35] is currently available

Read more

Summary

Introduction

The human pathogen Trichomonas vaginalis is a parabasalian flagellate that is estimated to infect 3% of the world’s population annually. The parabasalian flagellate Trichomonas vaginalis is a unique human parasite causing trichomoniasis, the most common sexually transmitted disease (STD) [1]. It has been suggested that the frequency of pseudogenes in T. vaginalis is at least 5% and that unstable gene families that underwent many gene duplication events, thereby producing pseudogenes on the way, further contributed to the large genome of T. vaginalis [11]. Genes of the Argonaute (AGO) and Dicer-like family are encoded by Trichomonas and suggest the existence of functional RNA interference mechanisms [5,14], other studies question the functionality of identified miRNAs in this parasite [19]. Recent deep-sequencing of the parasite’s transcriptome has shed light on the expression potential of the genome and provided evidence for the expression of about 30,000 genes and a correlated co-expression of gene families induced by different stimuli [10,28]

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.