Abstract

The newest generation of DNA sequencing technology is highlighted by the ability to generate sequence reads hundreds of kilobases in length. Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT) have pioneered competitive long read platforms, with more recent work focused on improving sequencing throughput and per-base accuracy. We used whole-genome sequencing data produced by three PacBio protocols (Sequel II CLR, Sequel II HiFi, RS II) and two ONT protocols (Rapid Sequencing and Ligation Sequencing) to compare assemblies of the bacteria Escherichia coli and the fruit fly Drosophila ananassae. In both organisms tested, Sequel II assemblies had the highest consensus accuracy, even after accounting for differences in sequencing throughput. ONT and PacBio CLR had the longest reads sequenced compared to PacBio RS II and HiFi, and genome contiguity was highest when assembling these datasets. ONT Rapid Sequencing libraries had the fewest chimeric reads in addition to superior quantification of E. coli plasmids versus ligation-based libraries. The quality of assemblies can be enhanced by adopting hybrid approaches using Illumina libraries for bacterial genome assembly or polishing eukaryotic genome assemblies, and an ONT-Illumina hybrid approach would be more cost-effective for many users. Genome-wide DNA methylation could be detected using both technologies, however ONT libraries enabled the identification of a broader range of known E. coli methyltransferase recognition motifs in addition to undocumented D. ananassae motifs. The ideal choice of long read technology may depend on several factors including the question or hypothesis under examination. No single technology outperformed others in all metrics examined.

Highlights

  • Long read sequencing technologies enable the production of highly contiguous and accurate genome assemblies

  • Here, we investigate the quality of long read sequencing data produced using four methods: Pacific Biosciences (PacBio) RS II, PacBio Sequel II continuous long read sequencing (CLR), Oxford Nanopore Technologies (ONT) Rapid Sequencing Kit (ONT RAPID), and ONT Ligation Sequencing Kit (ONT LIG)

  • We demonstrate that no method was superior in all analyses performed, and the decision to use PacBio and ONT platforms for sequencing may depend on the specific question being addressed

Read more

Summary

Introduction

Long read sequencing technologies enable the production of highly contiguous and accurate genome assemblies. Since the release of the Pacific Biosciences (PacBio) RS sequencer in 2011 and the Oxford Nanopore Technologies (ONT) MinION sequencer in 2014, improvements in sequencing chemistries and new sequencing platforms have continued to produce longer sequences and higher sequencing throughput, decreasing per-base sequencing costs [1]. The PacBio Sequel II system advertises the highest throughput out of any of its sequencing platforms and includes two distinct sequencing modes: continuous long read sequencing (CLR) for ultralong reads and circular consensus sequencing (CCS/PacBio HiFi) for highly-accurate consensus reads. Long read sequencing can be used as an alternative to bisulfite sequencing to detect genome-wide DNA methylation, producing the methylome of the organism. PacBio and Oxford Nanopore Technologies (ONT) have pioneered competitive long read platforms, with more recent work focused on improving sequencing throughput and per-base accuracy. Released in 2019, the PacBio Sequel II platform advertises substantial enhancements over previous PacBio systems

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call