Low-complexity and highly robust barcodes for error-rich single molecular sequencing.

Weigang Chen,Dalu Zhang,Lifu Song,Panpan Wang,Mingyong Han,Mingzhe Han,Lixia Wang

doi:10.1007/s13205-020-02607-5

Abstract

DNA barcodes are frequently corrupted due to insertion, deletion, and substitution errors during DNA synthesis, amplification and sequencing, resulting in index hopping. In this paper, we propose a new DNA barcode construction scheme that combines a cyclic block code with a predetermined pseudo-random sequence bit by bit to form bit pairs, and then converts the bit pairs to bases, i.e., the DNA barcodes. Then, we present a barcode identification scheme for noisy sequencing reads, which uses a combination of cyclic shifting and traditional dynamic programming to mark the insertion and deletion positions, and then performs erasure-and-error-correction decoding on the corrupted codewords. Furthermore, we verify the identification error rate of barcodes for multiple errors and evaluate the reliability of the barcodes in DNA context. This method can be easily generalized for constructing long barcodes, which may be used in scenarios with serious errors. Simulation results show that the bit error rate after identifying insertions/deletions is greatly reduced using the combination of cyclic shift and dynamic programming compared to using dynamic programming only. It indicates that the proposed method can effectively improve the accuracy for estimating insertion/deletion errors. And the overall identification error rate of the proposed method is lower than when the probability of each base mutation is less than 0.1, which is the typical scenario in third-generation sequencing.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Low-complexity and highly robust barcodes for error-rich single molecular sequencing.

Abstract

Talk to us

Similar Papers

More From: 3 Biotech

Lead the way for us

Journal: 3 Biotech	Publication Date: Jan 16, 2021
Citations: 1

Similar Papers

An enhanced minimum classification error learning framework for balancing insertion, deletion and substitution errors
Yuan Fu Liao ... Sen Chia Chang
-
Yuan Fu Liao, et. al. Yuan Fu Liao ... Sen Chia Chang
01 Jan 2007
01 Jan 2007

Insertion and deletion correcting DNA barcodes based on watermarks.
David Kracht ... Steffen Schober
BMC Bioinformatics | VOL. 16
David Kracht, et. al.David Kracht ... Steffen Schober
18 Feb 2015
BMC Bioinformatics | VOL. 16

Indel-correcting DNA barcodes for high-throughput sequencing
John A Hawkins ... William H Press
Proceedings of the National Academy of Sciences | VOL. 115
John A Hawkins, et. al.John A Hawkins ... William H Press
20 Jun 2018
Proceedings of the National Academy of Sciences | VOL. 115

Deformed fuzzy automata for correcting imperfect strings of fuzzy symbols
J Ramon Garitagoitia ... J Javier Astrain
IEEE Transactions on Fuzzy Systems | VOL. 11
J Ramon Garitagoitia, et. al.J Ramon Garitagoitia ... J Javier Astrain
01 Jun 2003
IEEE Transactions on Fuzzy Systems | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Low-complexity and highly robust barcodes for error-rich single molecular sequencing.

Abstract

Talk to us

Similar Papers

More From: 3 Biotech