Abstract

MotivationTechnological advances in meta-transcriptomics have enabled a deeper understanding of the structure and function of microbial communities. ‘Total RNA’ meta-transcriptomics, sequencing of total reverse transcribed RNA, provides a unique opportunity to investigate both the structure and function of active microbial communities from all three domains of life simultaneously. A major step of this approach is the reconstruction of full-length taxonomic marker genes such as the small subunit ribosomal RNA. However, current tools for this purpose are mainly targeted towards analysis of amplicon and metagenomic data and thus lack the ability to handle the massive and complex datasets typically resulting from total RNA experiments.ResultsIn this work, we introduce MetaRib, a new tool for reconstructing ribosomal gene sequences from total RNA meta-transcriptomic data. MetaRib is based on the popular rRNA assembly program EMIRGE, together with several improvements. We address the challenge posed by large complex datasets by integrating sub-assembly, dereplication and mapping in an iterative approach, with additional post-processing steps. We applied the method to both simulated and real-world datasets. Our results show that MetaRib can deal with larger datasets and recover more rRNA genes, which achieve around 60 times speedup and higher F1 score compared to EMIRGE in simulated datasets. In the real-world dataset, it shows similar trends but recovers more contigs compared with a previous analysis based on random sub-sampling, while enabling the comparison of individual contig abundances across samples for the first time.Availability and implementationThe source code of MetaRib is freely available at https://github.com/yxxue/MetaRib.Contact yaxin.xue@uib.no or Inge.Jonassen@uib.noSupplementary information Supplementary data are available at Bioinformatics online.

Highlights

  • I n t his c h a pt er, I will gi v e a bri ef i ntr o d u cti o n of t h e m ost us edtech ni q u es s u c h as m ar k er genesequen ci n g, w h ol egenomem et ageno mi cs a n d m et atr a ns cri pt o mi cs, w hi c h als olaythefound ati onand ar e hi g hl y r el e v a nt wit h m y pr oj e cts

  • M et a Ri b is b as edonthepop ul ar r R N A ass e m bl y pr o gr a m E MI R G E [ 1 5 8], t o g et h er wit h s e v er al i m pr oveme nts

  • I m pr o v e c urr e nt bi oi nf or m ati c w or kfl o w f or recov eri n g hi g h q u alit y M et agenome Ass e m bl edGenom es ( M A Gs ) wit hanewtaxonomy -b as e d r efi n e m e nt a p pr o a c h (Papers III andIV ). 5

Read more

Summary

I ntr o d u cti o n

Advanc es i n n e xt- g e n er ati onsequen ci nghavebo ost edthe st u d y of mi cr o bi al commu niti es inmanyec os yst e ms. C o nsi d eri n g t h at thecommu nit y str u ct ur e is r el ati veuneven, f or m ost n at ur al commu niti es, c o nti gs c orr es p o n di n g t o hi g hl y a b u nd a nt s p e ci es ar e m or e li k el ytobe ass e m bl edinthe first s e v er al it er ati o ns evenwhennN.

D N A e xtr a cti onRNAe xtr a cti o n
Ai m s of thethe si s
R e s ult s a n d Di s c u s si o n
C o n cl u di ngrem ar k s
B: D e n sit y of abundanceab
E MI R G E
Di s c u s si o n
F C P U426 B a si di o m y c ot a
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call