Abstract

BackgroundThe genome of the human malaria parasite Plasmodium falciparum is poorly annotated, in particular, the 5′ capped ends of its mRNA transcripts. New approaches are needed to fully catalog P. falciparum transcripts for understanding gene function and regulation in this organism.MethodsWe developed a transcriptomic method based on next-generation sequencing of complementary DNA (cDNA) enriched for full-length fragments using eIF4E, a 5′ cap-binding protein, and an unenriched control. DNA sequencing adapter was added after enrichment of full-length cDNA using two different ligation protocols. From the mapped sequence reads, enrichment scores were calculated for all transcribed nucleotides and used to calculate P-values of 5′ capped nucleotide enrichment. Sensitivity and accuracy were increased by combining P-values from replicate experiments. Data were obtained for P. falciparum ring, trophozoite and schizont stages of intra-erythrocytic development.Results5′ capped nucleotide signals were mapped to 17,961 non-overlapping P. falciparum genomic intervals. Analysis of the dominant 5′ capped nucleotide in these genomic intervals revealed the presence of two groups with distinctive epigenetic features and sequence patterns. A total of 4,512 transcripts were annotated as 5′ capped based on the correspondence of 5′ end with 5′ capped nucleotide annotated from full-length cDNA data.DiscussionThe presence of two groups of 5′ capped nucleotides suggests that alternative mechanisms may exist for producing 5′ capped transcript ends in P. falciparum. The 5′ capped transcripts that are antisense, outside of, or partially overlapping coding regions may be important regulators of gene function in P. falciparum.

Highlights

  • Malaria remains the most widespread parasitic disease of humans, with over 200 million cases in 2018 (World Health Organization, 2019)

  • We present a new method for identifying 5 capped nucleotides from mRNA transcripts. mRNA is purified from the biological sample and used to synthesize complementary DNA (cDNA)

  • Full-length cDNA fragments are enriched using recombinant eIF4E protein, which is capable of binding to the 5 cap nucleotide in first-strand cDNA products (Edery et al, 1995)

Read more

Summary

Introduction

Malaria remains the most widespread parasitic disease of humans, with over 200 million cases in 2018 (World Health Organization, 2019). The 5 ends of gene transcripts in P. falciparum are modified by the addition of a canonical eukaryotic 5 cap m7G nucleotide (Ho & Shuman, 2001), which can be exploited for enriching 5 - capped mRNA (Shaw et al, 2007; Shaw et al, 2016). The genome of the human malaria parasite Plasmodium falciparum is poorly annotated, in particular, the 5 capped ends of its mRNA transcripts. We developed a transcriptomic method based on next-generation sequencing of complementary DNA (cDNA) enriched for full-length fragments using eIF4E, a 5 cap-binding protein, and an unenriched control. A total of 4,512 transcripts were annotated as 5 capped based on the correspondence of 5 end with 5 capped nucleotide annotated from fulllength cDNA data. The 5 capped transcripts that are antisense, outside of, or partially overlapping coding regions may be important regulators of gene function in P. falciparum

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call