Long-read mapping to repetitive reference sequences using Winnowmap2.

Chirag Jain,Arang Rhie,Nancy F Hansen,Sergey Koren,Adam M Phillippy

doi:10.1038/s41592-022-01457-8

Abstract

Approximately 5-10% of the human genome remains inaccessible due to the presence of repetitive sequences such as segmental duplications and tandem repeat arrays. We show that existing long-read mappers often yield incorrect alignments and variant calls within long, near-identical repeats, as they remain vulnerable to allelic bias. In the presence of a nonreference allele within a repeat, a read sampled from that region could be mapped to an incorrect repeat copy. To address this limitation, we developed a new long-read mapping method, Winnowmap2, by using minimal confidently alignable substrings. Winnowmap2 computes each read mapping through a collection of confident subalignments. This approach is more tolerant of structural variation and more sensitive to paralog-specific variants within repeats. Our experiments highlight that Winnowmap2 successfully addresses the issue of allelic bias, enabling more accurate downstream variant calls in repetitive sequences.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Long-read mapping to repetitive reference sequences using Winnowmap2.

Abstract

Talk to us

Similar Papers

More From: Nature methods

Lead the way for us

Journal: Nature methods	Publication Date: Apr 1, 2022
Citations: 128

Similar Papers

Genome Organisation: Human
David H Kass ... Mark A Batzer
-
David H Kass, et. al.David H Kass ... Mark A Batzer
20 Apr 2021
20 Apr 2021

A novel repetitive DNA sequence in the genus Oryza
Tiyun Wu ... Ray Wu
Theoretical and Applied Genetics | VOL. 84
Tiyun Wu, et. al.Tiyun Wu ... Ray Wu
01 Jun 1992
Theoretical and Applied Genetics | VOL. 84

A multilocus approach for accurate variant calling in low-copy repeats using whole-genome sequencing.
Timofey Prodanov ... Vikas Bansal
Bioinformatics | VOL. 39
Timofey Prodanov, et. al.Timofey Prodanov ... Vikas Bansal
30 Jun 2023
Bioinformatics | VOL. 39

Molecular cloning and characterization of the repetitive DNA sequences that comprise the constitutive heterochromatin of the A and B chromosomes of the Korean field mouse (Apodemus peninsulae, Muridae, Rodentia)
Kazumi Matsubara ... Kazuo Moriwaki
Chromosome Research | VOL. 16
Kazumi Matsubara, et. al.Kazumi Matsubara ... Kazuo Moriwaki
01 Oct 2008
Chromosome Research | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Long-read mapping to repetitive reference sequences using Winnowmap2.

Abstract

Talk to us

Similar Papers

More From: Nature methods