Abstract

BackgroundIllumina paired-end sequencing has been increasingly popular for 16S rRNA gene-based microbiota profiling. It provides higher phylogenetic resolution than single-end reads due to a longer read length. However, the reverse read (R2) often has significant low base quality, and a large proportion of R2s will be discarded after quality control, resulting in a mixture of paired-end and single-end reads. A typical 16S analysis pipeline usually processes either paired-end or single-end reads but not a mixture. Thus, the quantification accuracy and statistical power will be reduced due to the loss of a large amount of reads. As a result, rare taxa may not be detectable with the paired-end approach, or low taxonomic resolution will result in a single-end approach.ResultsTo have both the higher phylogenetic resolution provided by paired-end reads and the higher sequence coverage by single-end reads, we propose a novel OTU-picking pipeline, hybrid-denovo, that can process a hybrid of single-end and paired-end reads. Using high-quality paired-end reads as a gold standard, we show that hybrid-denovo achieved the highest correlation with the gold standard and performed better than the approaches based on paired-end or single-end reads in terms of quantifying the microbial diversity and taxonomic abundances. By applying our method to a rheumatoid arthritis (RA) data set, we demonstrated that hybrid-denovo captured more microbial diversity and identified more RA-associated taxa than a paired-end or single-end approach.Conclusions Hybrid-denovo utilizes both paired-end and single-end 16S sequencing reads and is recommended for 16S rRNA gene targeted paired-end sequencing data.

Highlights

  • The microbiome plays an important role in global ecology, nutrient cycling, and disease [1]

  • To integrate information from both paired-end and singleend reads, we propose hybrid-denovo, a pipeline that combines paired-end and single-end reads in order to retain the advantages of de novo operational taxonomic unit (OTU)-picking while maximizing the ability to detect rare taxa

  • We proposed hybrid-denovo for de novo OTU-picking based on paired-end 16S sequence tags

Read more

Summary

Introduction

The microbiome plays an important role in global ecology, nutrient cycling, and disease [1]. Identifying related groups of organisms known as operational taxonomic units (OTUs) remains a central part of the analysis of microbiome data. Both de novo and reference-based approaches have been proposed for Received: 23 October 2017; Revised: 28 November 2017; Accepted: 7 December 2017. Illumina paired-end sequencing has been increasingly popular for 16S rRNA gene-based microbiota profiling. It provides higher phylogenetic resolution than single-end reads due to a longer read length. Conclusions: Hybrid-denovo utilizes both paired-end and single-end 16S sequencing reads and is recommended for 16S rRNA gene targeted paired-end sequencing data

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call