Indexing a sequence for mapping reads with a single mismatch

M. Sohel Rahman,Alessio Langiu,Maxime Crochemore

doi:10.1098/rsta.2013.0167

Abstract

Mapping reads against a genome sequence is an interesting and useful problem in computational molecular biology and bioinformatics. In this paper, we focus on the problem of indexing a sequence for mapping reads with a single mismatch. We first focus on a simpler problem where the length of the pattern is given beforehand during the data structure construction. This version of the problem is interesting in its own right in the context of the next generation sequencing. In the sequel, we show how to solve the more general problem. In both cases, our algorithm can construct an efficient data structure in O(n log(1+ε) n) time and space and can answer subsequent queries in O(m log log n + K) time. Here, n is the length of the sequence, m is the length of the read, 0<ε<1 and is the optimal output size.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Indexing a sequence for mapping reads with a single mismatch

Abstract

Talk to us

Similar Papers

More From: Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences

Lead the way for us

Journal: Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences	Publication Date: May 28, 2014
Citations: 45