Modelling, Characterization of Data-Dependent and Process-Dependent Errors in DNA Data Storage.

Yixin Wang,Yong L Guan,Md Noor-A-Rahim,Erry Gunawan,Chueh L Poh

doi:10.1109/tcbb.2022.3233914

Abstract

Using DNA as the medium to store information has recently been recognized as a promising solution for long-term data storage. While several system prototypes have been demonstrated, the error characteristics in DNA data storage are discussed with limited content. Due to the data and process variations from experiment to experiment, the error variation and its effect on data recovery remain to be uncovered. To close the gap, we systematically investigate the storage channel, i.e., error characteristics in the storage process. In this work, we first propose a new concept named sequence corruption to unify the error characteristics into the sequence level, easing the channel analysis. Then we derived the formulations of the data imperfection at the decoder including both sequence loss and sequence corruption, revealing the decoding demand and monitoring the data recovery. Furthermore, we extensively explored several data-dependent unevenness observed in the base error patterns and studied a few potential factors and their impacts on the data imperfection at the decoder both theoretically and experimentally. The results presented here introduce a more comprehensive channel model and offer a new angle towards the data recovery issue in DNA data storage by further elucidating the error characteristics of the storage process.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Modelling, Characterization of Data-Dependent and Process-Dependent Errors in DNA Data Storage.

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Computational Biology and Bioinformatics

Lead the way for us

Journal: IEEE/ACM Transactions on Computational Biology and Bioinformatics	Publication Date: May 1, 2023
Citations: 5

Similar Papers

DNA Data Storage: The Fusion of Digital and Biological Information
Xiao Li
Theoretical and Natural Science | VOL. 4
Xiao LiXiao Li
28 Apr 2023
Theoretical and Natural Science | VOL. 4

Optimizing fountain codes for DNA data storage
Peter Michael Schwarz ... Bernd Freisleben
Computational and Structural Biotechnology Journal | VOL. -
Peter Michael Schwarz, et. al.Peter Michael Schwarz ... Bernd Freisleben
01 Oct 2024
Computational and Structural Biotechnology Journal | VOL. -

Recent progress in DNA data storage based on high-throughput DNA synthesis.
Seokwoo Jo ... Honggu Chun
Biomedical engineering letters | VOL. 14
Seokwoo Jo, et. al.Seokwoo Jo ... Honggu Chun
03 May 2024
Biomedical engineering letters | VOL. 14

Decoding DNA data storage for investment
Philip M Stanley ... Kevin C.K Lee
Biotechnology Advances | VOL. 45
Philip M Stanley, et. al.Philip M Stanley ... Kevin C.K Lee
28 Sep 2020
Biotechnology Advances | VOL. 45

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Modelling, Characterization of Data-Dependent and Process-Dependent Errors in DNA Data Storage.

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Computational Biology and Bioinformatics