Hybrid error correction and de novo assembly of single-molecule sequencing reads

Sergey Koren,Michael C Schatz,Brian P Walenz,Zhong Wang,Jeffrey Martin,Ganeshkumar Ganapathy,Jason T Howard,David A Rasko,Erich D Jarvis,Adam M Phillippy,W Richard Mccombie

doi:10.1038/nbt.2280

Abstract

Emerging single-molecule sequencing instruments can generate multi-kilobase sequences with the potential to dramatically improve genome and transcriptome assembly. However, the high error rate of single-molecule reads is challenging, and has limited their use to resequencing bacteria. To address this limitation, we introduce a novel correction algorithm and assembly strategy that utilizes shorter, high-identity sequences to correct the error in single-molecule sequences. We demonstrate the utility of this approach on Pacbio RS reads of phage, prokaryotic, and eukaryotic whole genomes, including the novel genome of the parrot Melopsittacus undulatus, as well as for RNA-seq reads of the corn (Zea mays) transcriptome. Our approach achieves over 99.9% read correction accuracy and produces substantially better assemblies than current sequencing strategies: in the best example, quintupling the median contig size relative to high-coverage, second-generation assemblies. Greater gains are predicted if read lengths continue to increase, including the prospect of single-contig bacterial chromosome assembly.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nature Biotechnology	Publication Date: Jul 1, 2012
Citations: 962	License type: unspecified-oa

R Discovery Prime

R Discovery Prime

Hybrid error correction and de novo assembly of single-molecule sequencing reads

Abstract

Talk to us

Similar Papers

More From: Nature Biotechnology

Lead the way for us

Similar Papers

Targeted single molecule sequencing methodology for ovarian hyperstimulation syndrome.
Funda Orkunoglu-Suer ... David Frankfurter
BMC Genomics | VOL. 16
Funda Orkunoglu-Suer, et. al.Funda Orkunoglu-Suer ... David Frankfurter
03 Apr 2015
BMC Genomics | VOL. 16

MASQC: Next Generation Sequencing Assists Third Generation Sequencing for Quality Control in N6-Methyladenine DNA Identification.
Siqian Yang ... Ying Chen
Frontiers in Genetics | VOL. 11
Siqian Yang, et. al.Siqian Yang ... Ying Chen
24 Mar 2020
Frontiers in Genetics | VOL. 11

Comparative performance of transcriptome assembly methods for non-model organisms.
Xin Huang ... Xiao-Guang Chen
BMC Genomics | VOL. 17
Xin Huang, et. al.Xin Huang ... Xiao-Guang Chen
27 Jul 2016
BMC Genomics | VOL. 17

Single-Molecule Real-Time Sequencing Combined with Optical Mapping Yields Completely Finished Fungal Genome.
Luigi Faino ... Grardy C M Van Den Berg
mBio | VOL. 6
Luigi Faino, et. al.Luigi Faino ... Grardy C M Van Den Berg
18 Aug 2015
mBio | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hybrid error correction and de novo assembly of single-molecule sequencing reads

Abstract

Talk to us

Similar Papers

More From: Nature Biotechnology