NaiveBayesCall: An Efficient Model-Based Base-Calling Algorithm for High-Throughput Sequencing

Wei-Chun Kao,Yun S Song

doi:10.1089/cmb.2010.0247

Abstract

Immense amounts of raw instrument data (i.e., images of fluorescence) are currently being generated using ultra high-throughput sequencing platforms. An important computational challenge associated with this rapid advancement is to develop efficient algorithms that can extract accurate sequence information from raw data. To address this challenge, we recently introduced a novel model-based base-calling algorithm that is fully parametric and has several advantages over previously proposed methods. Our original algorithm, called BayesCall, significantly reduced the error rate, particularly in the later cycles of a sequencing run, and also produced useful base-specific quality scores with a high discrimination ability. Unfortunately, however, BayesCall is too computationally expensive to be of broad practical use. In this article, we build on our previous model-based approach to devise an efficient base-calling algorithm that is orders of magnitude faster than BayesCall, while still maintaining a comparably high level of accuracy. Our new algorithm is called naive-BayesCall, and it utilizes approximation and optimization methods to achieve scalability. We describe the performance of naiveBayesCall and demonstrate how improved base-calling accuracy may facilitate de novo assembly and SNP detection when the sequence coverage depth is low to moderate.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

NaiveBayesCall: An Efficient Model-Based Base-Calling Algorithm for High-Throughput Sequencing

Abstract

Talk to us

Similar Papers

More From: Journal of computational biology : a journal of computational molecular cell biology

Lead the way for us

Journal: Journal of computational biology : a journal of computational molecular cell biology	Publication Date: Mar 1, 2011
Citations: 29

Similar Papers

NaiveBayesCall: An Efficient Model-Based Base-Calling Algorithm for High-Throughput Sequencing
Wei-Chun Kao ... Yun S Song
-
Wei-Chun Kao, et. al.Wei-Chun Kao ... Yun S Song
01 Jan 2009
01 Jan 2009

Direct-to-consumer raw genetic data and third-party interpretation services: more burden than bargain?
Tia Moscarello ... Erin Demo
Genetics in Medicine | VOL. 21
Tia Moscarello, et. al.Tia Moscarello ... Erin Demo
01 Mar 2019
Genetics in Medicine | VOL. 21

BayesCall: A model-based base-calling algorithm for high-throughput short-read sequencing
Wei-Chun Kao ... Yun S Song
Genome research | VOL. 19
Wei-Chun Kao, et. al.Wei-Chun Kao ... Yun S Song
06 Aug 2009
Genome research | VOL. 19

Genome-wide SNP calling using next generation sequencing data in tomato.
Ji-Eun Kim ... Jeong-Hee Lee
Molecules and cells | VOL. 37
Ji-Eun Kim, et. al.Ji-Eun Kim ... Jeong-Hee Lee
01 Jan 2014
Molecules and cells | VOL. 37

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

NaiveBayesCall: An Efficient Model-Based Base-Calling Algorithm for High-Throughput Sequencing

Abstract

Talk to us

Similar Papers

More From: Journal of computational biology : a journal of computational molecular cell biology