Abstract

Next-generation sequencing technology has enabled the detection of rare genetic or somatic mutations and contributed to our understanding of disease progression and evolution. However, many next-generation sequencing technologies first rely on DNA amplification, via the Polymerase Chain Reaction (PCR), as part of sample preparation workflows. Mistakes made during PCR appear in sequencing data and contribute to false mutations that can ultimately confound genetic analysis. In this report, a single-molecule sequencing assay was used to comprehensively catalog the different types of errors introduced during PCR, including polymerase misincorporation, structure-induced template-switching, PCR-mediated recombination and DNA damage. In addition to well-characterized polymerase base substitution errors, other sources of error were found to be equally prevalent. PCR-mediated recombination by Taq polymerase was observed at the single-molecule level, and surprisingly found to occur as frequently as polymerase base substitution errors, suggesting it may be an underappreciated source of error for multiplex amplification reactions. Inverted repeat structural elements in lacZ caused polymerase template-switching between the top and bottom strands during replication and the frequency of these events were measured for different polymerases. For very accurate polymerases, DNA damage introduced during temperature cycling, and not polymerase base substitution errors, appeared to be the major contributor toward mutations occurring in amplification products. In total, we analyzed PCR products at the single-molecule level and present here a more complete picture of the types of mistakes that occur during DNA amplification.

Highlights

  • Genetic variation underlies many fundamental aspects of biology

  • To validate the use of single-molecule sequencing for measuring replication fidelity, the fidelity of Taq polymerase was measured using both PacBio Single Molecule Real-Time (SMRT) and traditional Sanger sequencing

  • PacBio sequencing gave a similar distribution for base substitutions (97.3%), deletions (2.6%) and insertions (0.1%), the fraction of insertions and deletions was slightly higher than those observed by Sanger sequencing

Read more

Summary

Introduction

Genetic variation underlies many fundamental aspects of biology. Mutations drive speciation or cause disease, and their detection has been critical to our understanding of evolution and translational medicine. Spontaneous mutations accumulate in somatic cells, and much progress has been made in understanding their contribution to cancer and aging [1]. Understanding disease progression and optimizing therapy, often requires the detection of rare mutations in heterogeneous samples. PLOS ONE | DOI:10.1371/journal.pone.0169774 January 6, 2017

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.