Benchmarking taxonomic assignments based on 16S rRNA gene profiling of the microbiota from commonly sampled environments.

Alexandre Almeida,Aleksandra Tarkowska,Alex L Mitchell,Robert D Finn

doi:10.1093/gigascience/giy054

Alexandre Almeida, Aleksandra Tarkowska + Show 2 more

Open Access

https://doi.org/10.1093/gigascience/giy054

Copy DOI

Abstract

BackgroundTaxonomic profiling of ribosomal RNA (rRNA) sequences has been the accepted norm for inferring the composition of complex microbial ecosystems. Quantitative Insights Into Microbial Ecology (QIIME) and mothur have been the most widely used taxonomic analysis tools for this purpose, with MAPseq and QIIME 2 being two recently released alternatives. However, no independent and direct comparison between these four main tools has been performed. Here, we compared the default classifiers of MAPseq, mothur, QIIME, and QIIME 2 using synthetic simulated datasets comprised of some of the most abundant genera found in the human gut, ocean, and soil environments. We evaluate their accuracy when paired with both different reference databases and variable sub-regions of the 16S rRNA gene.FindingsWe show that QIIME 2 provided the best recall and F-scores at genus and family levels, together with the lowest distance estimates between the observed and simulated samples. However, MAPseq showed the highest precision, with miscall rates consistently <2%. Notably, QIIME 2 was the most computationally expensive tool, with CPU time and memory usage almost 2 and 30 times higher than MAPseq, respectively. Using the SILVA database generally yielded a higher recall than using Greengenes, while assignment results of different 16S rRNA variable sub-regions varied up to 40% between samples analysed with the same pipeline.ConclusionsOur results support the use of either QIIME 2 or MAPseq for optimal 16S rRNA gene profiling, and we suggest that the choice between the two should be based on the level of recall, precision, and/or computational performance required.

Highlights

Title: Benchmarking taxonomic assignments based on 16S rRNA gene profiling of the microbiota from commonly sampled environments
It would be nice to see the overall dissimilaritymetrics presented unaggregated by method and biome in a supplemental table
I agree to the open peer review policy of the journal

Summary

Reviewer Report

Title: Benchmarking taxonomic assignments based on 16S rRNA gene profiling of the microbiota from commonly sampled environments. # OverviewIn this paper the authors have sought to evaluate the performance of the 4 main packages and their default classifiers/settings used in the taxonomic profiling of rRNA sequences.They did this using synthetic simulated read sets representative of 3 commonlystudied microbiome environments and investigated the role of locus andreference database selection on classification metrics.This is well done research that will form a useful benchmark for researchers engaged in rRNA taxonomic profiling to help design and conduct their own studies.## General CommentsIt should be emphasised throughout the manuscript that as of January 1st 2018, QIIME1 is deprecated and no longer supported by the developers(https://qiime.wordpress.com/2018/01/03/qiime-2-has-succeeded-qiime-1/).QIIME1 is no longer recommended to be used at all.Secondly, it is probably worth emhpasising that QIIME1, QIIME2 and mothurare very large toolsets with many parts and functions capable of more than just taxonomic assignment. The methods section may benefit from inclusionof this database information.Figure S4: Would be nice to include a key as per Figure 1 instead of needing to crossreference to the tables

Declaration of Competing Interests

Do you have any other financial competing interests?

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: GigaScience	Publication Date: May 1, 2018
Citations: 91	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Benchmarking taxonomic assignments based on 16S rRNA gene profiling of the microbiota from commonly sampled environments.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: GigaScience

Lead the way for us

Similar Papers

Ultrafast and accurate 16S rRNA microbial community analysis using Kraken 2
Jennifer Lu ... Steven L Salzberg
Microbiome | VOL. 8
Jennifer Lu, et. al.Jennifer Lu ... Steven L Salzberg
28 Aug 2020
Microbiome | VOL. 8

Comparative Metagenomic Analysis of Human Gut Microbiome Composition Using Two Different Bioinformatic Pipelines
Valeria D’Argenio ... Giorgio Casaburi
BioMed Research International | VOL. 2014
Valeria D’Argenio, et. al.Valeria D’Argenio ... Giorgio Casaburi
01 Jan 2014
BioMed Research International | VOL. 2014

Visualizing Patterns of Marine Eukaryotic Diversity from Metabarcoding Data Using QIIME.
Matthieu Leray ... Nancy Knowlton
Methods in molecular biology (Clifton, N.J.) | VOL. 1452
Matthieu Leray, et. al.Matthieu Leray ... Nancy Knowlton
01 Jan 2015
Methods in molecular biology (Clifton, N.J.) | VOL. 1452

A Comparison of Three Bioinformatics Pipelines for the Analysis of Preterm Gut Microbiota using 16S rRNA Gene Sequencing Data
Erica Plummer ... Jimmy Twin
Journal of Proteomics & Bioinformatics | VOL. 8
Erica Plummer, et. al.Erica Plummer ... Jimmy Twin
01 Jan 2015
Journal of Proteomics & Bioinformatics | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Benchmarking taxonomic assignments based on 16S rRNA gene profiling of the microbiota from commonly sampled environments.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: GigaScience