Exact Sketch-Based Read Mapping.

Tizian Schulz,Tizian Schulz,Tizian Schulz,Paul Medvedev,Paul Medvedev,Paul Medvedev,Paul Medvedev,Paul Medvedev,Paul Medvedev,Paul Medvedev,Paul Medvedev,Paul Medvedev

doi:10.4230/lipics.wabi.2023.14

Abstract

Given a sequencing read, the broad goal of read mapping is to find the location(s) in the reference genome that have a "similar sequence". Traditionally, "similar sequence" was defined as having a high alignment score and read mappers were viewed as heuristic solutions to this well-defined problem. For sketch-based mappers, however, there has not been a problem formulation to capture what problem an exact sketch-based mapping algorithm should solve. Moreover, there is no sketch-based method that can find all possible mapping positions for a read above a certain score threshold. In this paper, we formulate the problem of read mapping at the level of sequence sketches. We give an exact dynamic programming algorithm that finds all hits above a given similarity threshold. It runs in time and space, where is the number of -mers inside the sketch of the reference, is the number of -mers inside the read's sketch and is the number of times that -mers from the pattern sketch occur in the sketch of the text. We evaluate our algorithm's performance in mapping long reads to the T2T assembly of human chromosome Y, where ampliconic regions make it desirable to find all good mapping positions. For an equivalent level of precision as minimap2, the recall of our algorithm is 0.88, compared to only 0.76 of minimap2.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exact Sketch-Based Read Mapping.

Abstract

Talk to us

Similar Papers

More From: LIPIcs : Leibniz international proceedings in informatics

Lead the way for us

Journal: LIPIcs : Leibniz international proceedings in informatics	Publication Date: Sep 1, 2023
Citations: 1

Similar Papers

ESKEMAP: exact sketch-based read mapping
Tizian Schulz ... Paul Medvedev
Algorithms for Molecular Biology | VOL. 19
Tizian Schulz, et. al.Tizian Schulz ... Paul Medvedev
04 May 2024
Algorithms for Molecular Biology | VOL. 19

Author response: Targeted genomic sequencing with probe capture for discovery and surveillance of coronaviruses in bats
...
-
, et. al. ...
30 Sep 2022
30 Sep 2022

Predicting gene dosage using genomic sequence data
Jocelyn Elaine Barker ... James Hartman
The FASEB Journal | VOL. 22
Jocelyn Elaine Barker, et. al.Jocelyn Elaine Barker ... James Hartman
01 Mar 2008
The FASEB Journal | VOL. 22

G-SNPM - A GPU-based SNP mapping tool
Alessandro Orro ... Andrea Manconi
EMBnet.journal | VOL. 18
Alessandro Orro, et. al.Alessandro Orro ... Andrea Manconi
09 Nov 2012
EMBnet.journal | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exact Sketch-Based Read Mapping.

Abstract

Talk to us

Similar Papers

More From: LIPIcs : Leibniz international proceedings in informatics