Refining the r-index

Hideo Bannai,Travis Gagie,Tomohiro I

doi:10.1016/j.tcs.2019.08.005

Abstract

Gagie, Navarro and Prezza's r-index (SODA, 2018) promises to speed up DNA alignment and variation calling by allowing us to index entire genomic databases, provided certain obstacles can be overcome. In this paper we first strengthen and simplify Policriti and Prezza's Toehold Lemma (DCC '16; Algorithmica, 2017), which inspired the r-index and plays an important role in its implementation. We then show how to update the r-index efficiently after adding a new genome to the database, which is likely to be vital in practice. As a by-product of this result, we obtain an online version of Policriti and Prezza's algorithm for constructing the LZ77 parse from a run-length compressed Burrows-Wheeler Transform. Our experiments demonstrate the practicality of all three of these results. Finally, we show how to augment the r-index such that, given a new genome and fast random access to the database, we can quickly compute the matching statistics and maximal exact matches of the new genome with respect to the database.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Theoretical Computer Science	Publication Date: Aug 7, 2019
Citations: 35	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

Refining the r-index

Abstract

Talk to us

Similar Papers

More From: Theoretical Computer Science

Lead the way for us

Similar Papers

MEDAL
Wenqin Huangfu ... Peng Gu
-
Wenqin Huangfu, et. al.Wenqin Huangfu ... Peng Gu
12 Oct 2019
12 Oct 2019

A genomic distance for assembly comparison based on compressed maximal exact matches.
Sara P Garcia ... Armando J Pinho
IEEE/ACM Transactions on Computational Biology and Bioinformatics | VOL. 10
Sara P Garcia, et. al.Sara P Garcia ... Armando J Pinho
01 May 2013
IEEE/ACM Transactions on Computational Biology and Bioinformatics | VOL. 10

Finding maximal exact matches in graphs
Nicola Rizzo ... Veli Mäkinen
Algorithms for Molecular Biology | VOL. 19
Nicola Rizzo, et. al.Nicola Rizzo ... Veli Mäkinen
11 Mar 2024
Algorithms for Molecular Biology | VOL. 19

Finding Maximal Exact Matches Using the r-Index.
Massimiliano Rossi ... Ben Langmead
Journal of computational biology : a journal of computational molecular cell biology | VOL. 29
Massimiliano Rossi, et. al.Massimiliano Rossi ... Ben Langmead
17 Jan 2022
Journal of computational biology : a journal of computational molecular cell biology | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Refining the r-index

Abstract

Talk to us

Similar Papers

More From: Theoretical Computer Science