Clustering-Correcting Codes

Tal Shinkar,Andreas Lenz,Antonia Wachter-Zeh,Eitan Yaakobi

doi:10.1109/tit.2021.3127174

Abstract

A new family of codes, called <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">clustering-correcting codes</i> , is presented in this paper. This family of codes is motivated by the special structure of the data that is stored in DNA-based storage systems. The data stored in these systems has the form of unordered sequences, also called <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">strands</i> , and every strand is synthesized thousands to millions of times, where some of these copies are read back during sequencing. Due to the unordered structure of the strands, an important task in the decoding process is to place them in their correct order. This is usually accomplished by allocating part of the strand for an index. However, in the presence of errors in the index field, important information on the order of the strands may be lost. Clustering-correcting codes ensure that if the distance between the index fields of two strands is small, their data fields have large distance. It is shown how this property enables to place the strands together in their correct clusters even in the presence of errors. We present lower and upper bounds on the size of clustering-correcting codes and an explicit construction of these codes which uses only a single symbol of redundancy. The results are first presented for the Hamming metric and are then extended for the edit distance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Clustering-Correcting Codes

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Information Theory

Lead the way for us

Journal: IEEE Transactions on Information Theory	Publication Date: Mar 1, 2022
Citations: 5

Similar Papers

Clustering-Correcting Codes
Tal Shinkar ... Andreas Lenz
-
Tal Shinkar, et. al.Tal Shinkar ... Andreas Lenz
01 Jul 2019
01 Jul 2019

On optimal family of codes for archival DNA storage
Dixita Limbachiya ... Madhav Khakhar
-
Dixita Limbachiya, et. al.Dixita Limbachiya ... Madhav Khakhar
01 Sep 2015
01 Sep 2015

A new family of 2-D wavelength-time codes for optical CDMA with differential detection
R.M.H Yim ... L.R Chen
IEEE Photonics Technology Letters | VOL. 15
R.M.H Yim, et. al.R.M.H Yim ... L.R Chen
01 Jan 2003
IEEE Photonics Technology Letters | VOL. 15

Asymmetric Lee Distance Codes for DNA-Based Storage
Ryan Gabrys ... Han Mao Kiah
IEEE Transactions on Information Theory | VOL. 63
Ryan Gabrys, et. al.Ryan Gabrys ... Han Mao Kiah
01 Aug 2017
IEEE Transactions on Information Theory | VOL. 63

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Clustering-Correcting Codes

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Information Theory