Abstract

BackgroundRare single nucleotide variants play an important role in genetic diversity and heterogeneity of specific human disease. For example, an individual clinical sample can harbor rare mutations at minor frequencies. Genetic diversity within an individual clinical sample is oftentimes reflected in rare mutations. Therefore, detecting rare variants prior to treatment may prove to be a useful predictor for therapeutic response. Current rare variant detection algorithms using next generation DNA sequencing are limited by inherent sequencing error rate and platform availability.FindingsHere we describe an optimized implementation of a rare variant detection algorithm called RVD for use in targeted gene resequencing. RVD is available both as a command-line program and for use in MATLAB and estimates context-specific error using a beta-binomial model to call variants with minor allele frequency (MAF) as low as 0.1%. We show that RVD accepts standard BAM formatted sequence files. We tested RVD analysis on multiple Illumina sequencing platforms, among the most widely used DNA sequencing platforms.ConclusionsRVD meets a growing need for highly sensitive and specific tools for variant detection. To demonstrate the usefulness of RVD, we carried out a thorough analysis of the software’s performance on synthetic and clinical virus samples sequenced on both an Illumina GAIIx and a MiSeq. We expect RVD can improve understanding the genetics and treatment of common viral diseases including influenza. RVD is available at the following URL:http://dna-discovery.stanford.edu/software/rvd/.

Highlights

  • Rare single nucleotide variants play an important role in genetic diversity and heterogeneity of specific human disease

  • RVD is available at the following URL:http://dna-discovery.stanford.edu/software/rvd/

  • Setting the resolution threshold By testing a range of resolution thresholds, we find that an optimal threshold to jointly maximize sensitivity and specificity is 1⁄2 of the desired minor allele frequency (MAF) detection level

Read more

Summary

Introduction

Rare single nucleotide variants play an important role in genetic diversity and heterogeneity of specific human disease. Genetic diversity within an individual clinical sample is oftentimes reflected in rare mutations. Current rare variant detection algorithms using generation DNA sequencing are limited by inherent sequencing error rate and platform availability. Generation sequencing (NGS) is currently used in a research setting to discover novel mutations in cancer, viral, and environmental samples. As the cost of sequencing decreases, this technology is increasingly used to assess genetic diversity both for basic research as well as translational applications in human diseases. The detection level of current algorithms is limited by the inherent error rate in generation sequencing technologies, generally quoted as 1-3% [1,2] or 0.25% in [3]. CRISP [4] reports to detect variants in large pooled data sets at 2% MAF on an Illumina GA platform but with only 86.3% sensitivity and 97% specificity. By significantly changing the sample preparation technique, Schmitt et al report a resolution of 1x10-9 [8]

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call