Abstract

Amplicon sequencing of the 16S rRNA gene is commonly used for the identification of bacterial isolates in diagnostic laboratories and mostly relies on the Sanger sequencing method. The latter, however, suffers from a number of limitations, with the most significant being the inability to resolve mixed amplicons when closely related species are coamplified from a mixed culture. This often leads to either increased turnaround time or absence of usable sequence data. Short-read next-generation sequencing (NGS) technologies could solve the mixed amplicon issue but would lack both cost efficiency at low throughput and fast turnaround times. Nanopore sequencing developed by Oxford Nanopore Technologies (ONT) could solve those issues by enabling a flexible number of samples per run and an adjustable sequencing time. Here, we report on the development of a standardized laboratory workflow combined with a fully automated analysis pipeline LORCAN (long read consensus analysis), which together provide a sample-to-report solution for amplicon sequencing and taxonomic identification of the resulting consensus sequences. Validation of the approach was conducted on a panel of reference strains and on clinical samples consisting of single or mixed rRNA amplicons associated with various bacterial genera by direct comparison to the corresponding Sanger sequences. Additionally, simulated read and amplicon mixtures were used to assess LORCAN's behavior when dealing with samples with known cross-contamination levels. We demonstrate that by combining ONT amplicon sequencing results with LORCAN, the accuracy of Sanger sequencing can be closely matched (>99.6% sequence identity) and that mixed samples can be resolved at the single-base resolution level. The presented approach has the potential to significantly improve the flexibility, reliability, and availability of amplicon sequencing in diagnostic settings.

Highlights

  • The sequencing of the 16S rRNA gene is essential to describe the diversity of the human microbiome (1, 2)

  • We present a standardized laboratory workflow accompanied by a fully automated analysis pipeline, which together provide a sample-to-report solution for taxonomic identification of bacterial cultures based on amplicon sequencing of their 16S rRNA genes (Fig. 1)

  • Validation of the sequencing results was conducted by direct comparison to Sanger sequencing with real clinical samples consisting of pure or mixed rRNA amplicons belonging to several bacterial genera (Bacteroides, Eggerthella, Enterococcus, Klebsiella, Mycobacterium, Campylobacter, Pseudomonas) of expected amplicon sizes of 500 bp

Read more

Summary

Introduction

The sequencing of the 16S rRNA gene is essential to describe the diversity of the human microbiome (1, 2). The third-generation single-molecule sequencing technology provided by Oxford Nanopore Technologies (ONT) might offer the necessary flexibility in throughput and is capable of producing reads with lengths of several hundred to several hundred thousand bases at competitive costs (9). Available workflows are often limited to the analysis of pure amplicons (10–13), include complex modifications of the ONT laboratory workflows (15, 16), or lack published validation by using samples other than mock communities (17, 18). We developed a complete workflow based on standard ONT protocols and a fully automated analysis pipeline LORCAN capable of producing high-quality consensus sequences and thorough taxonomic analysis from pure and low-complexity cultures. We explored the robustness of LORCAN’s consensus generation and species identification by analyzing artificial mixtures of amplicons and reads at different levels of genetic distances

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call