Abstract

Most structural variant (SV) detection methods use clusters of discordant read-pair and split-read alignments to identify variants yet do not integrate depth of sequence coverage as an additional means to support or refute putative events. Here, we present "duphold," a new method to efficiently annotate SV calls with sequence depth information that can add (or remove) confidence to SVs that are predicted to affect copy number. Duphold indicates not only the change in depth across the event but also the presence of a rapid change in depth relative to the regions surrounding the break-points. It uses a unique algorithm that allows the run time to be nearly independent of the number of variants. This performance is important for large, jointly called projects with many samples, each of which must be evaluated at thousands of sites. We show that filtering on duphold annotations can greatly improve the specificity of SV calls. Duphold can annotate SV predictions made from both short-read and long-read sequencing datasets. It is available under the MIT license at https://github.com/brentp/duphold.

Highlights

  • Most structural variant detection tools use clusters of discordant read-pair and split-read alignments to identify variants, yet do not integrate depth of sequence coverage as an additional means to support or refute putative events

  • Structural variants (SV) are known to be more difficult to detect with high accuracy than single-nucleotide and insertion-deletion variants

  • The most commonly used structural variant callers[1​ –5] use two types of sequence alignments to discover structural variation: paired-end reads having an unusual orientation or insert size, and split-reads, where the sequence is aligned to different parts of the genome

Read more

Summary

Introduction

Most structural variant detection tools use clusters of discordant read-pair and split-read alignments to identify variants, yet do not integrate depth of sequence coverage as an additional means to support or refute putative events. We present duphold,as a new method to efficiently annotate structural variant calls with sequence depth information that can add (or remove) confidence to SV predicted to affect copy number. It indicates the change in depth across the event, and the presence of a rapid change in depth relative to the regions surrounding the breakpoints. It uses a unique algorithm that allows the run time to be nearly independent of the number of variants. This performance is important for large, jointly-called projects with many samples, each of which must be evaluated at thousands of sites. It is available under the MIT license at: https://github.com/brentp/duphold

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call