Impact of next-generation sequencing error on analysis of barcoded plasmid libraries of known complexity and sequence.

Claire T Deakin,David Humphreys,Paul Young,Claus V Hallwirth,Ian E Alexander,Jeffrey J Deakin,Catherine M Suter,Samantha L Ginn

doi:10.1093/nar/gku607

Abstract

Barcoded vectors are promising tools for investigating clonal diversity and dynamics in hematopoietic gene therapy. Analysis of clones marked with barcoded vectors requires accurate identification of potentially large numbers of individually rare barcodes, when the exact number, sequence identity and abundance are unknown. This is an inherently challenging application, and the feasibility of using contemporary next-generation sequencing technologies is unresolved. To explore this potential application empirically, without prior assumptions, we sequenced barcode libraries of known complexity. Libraries containing 1, 10 and 100 Sanger-sequenced barcodes were sequenced using an Illumina platform, with a 100-barcode library also sequenced using a SOLiD platform. Libraries containing 1 and 10 barcodes were distinguished from false barcodes generated by sequencing error by a several log-fold difference in abundance. In 100-barcode libraries, however, expected and false barcodes overlapped and could not be resolved by bioinformatic filtering and clustering strategies. In independent sequencing runs multiple false-positive barcodes appeared to be represented at higher abundance than known barcodes, despite their confirmed absence from the original library. Such errors, which potentially impact barcoding studies in an application-dependent manner, are consistent with the existence of both stochastic and systematic error, the mechanism of which is yet to be fully resolved.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nucleic acids research	Publication Date: Jul 10, 2014
Citations: 32	License type: CC BY-NC 3.0

R Discovery Prime

R Discovery Prime

Impact of next-generation sequencing error on analysis of barcoded plasmid libraries of known complexity and sequence.

Abstract

Talk to us

Similar Papers

More From: Nucleic acids research

Lead the way for us

Similar Papers

Assessment of antibody library diversity through next generation sequencing and technical error compensation.
Marco Fantini ... Simonetta Lisi
PLOS ONE | VOL. 12
Marco Fantini, et. al.Marco Fantini ... Simonetta Lisi
15 May 2017
PLOS ONE | VOL. 12

Sequencing accuracy and systematic errors of nanopore direct RNA sequencing
Wang Liu-Wei ... Martin Hölzer
BMC genomics | VOL. 25
Wang Liu-Wei, et. al.Wang Liu-Wei ... Martin Hölzer
28 May 2024
BMC genomics | VOL. 25

Abstract A57: Uncovering instrument errors in next-generation sequencing by CleanDeepSeq2
Eric Davis ... John Easton
Clinical Cancer Research | VOL. 26
Eric Davis, et. al.Eric Davis ... John Easton
01 Jun 2020
Clinical Cancer Research | VOL. 26

Detection of BCR-ABL1 Compound and Polyclonal Mutants in Chronic Myeloid Leukemia Patients Using a Novel Next Generation Sequencing Approach That Minimises PCR and Sequencing Errors
Wendy T Parker ... Susan Branford
Blood | VOL. 124
Wendy T Parker, et. al.Wendy T Parker ... Susan Branford
06 Dec 2014
Blood | VOL. 124

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Impact of next-generation sequencing error on analysis of barcoded plasmid libraries of known complexity and sequence.

Abstract

Talk to us

Similar Papers

More From: Nucleic acids research