Abstract

Deoxyribonucleic acid (DNA) has been suggested as a very promising medium for data storage in recent years. Although numerous studies have advocated for DNA data storage, its practical application remains obscure and there is a lack of a user-oriented platform. Here, we developed a DNA data storage platform, named Storage-D, which allows users to convert their data into DNA sequences of any length and vice versa by selecting algorithms, error-correction, random-access, and codec pin strategies in terms of their own choice. It incorporates a newly designed "Wukong" algorithm, which provides over 20 trillion codec pins for data privacy use. This algorithm can also control GC content to the selected standard, as well as adjust the homopolymer run length to a defined level, while maintaining a high coding potential of ~1.98 bis/nt, allowing it to outperform previous algorithms. By connecting to a commercial DNA synthesis and sequencing platform with "Storage-D," we successfully stored "Diagnosis and treatment protocol for COVID-19 patients" into 200 nt oligo pools in vitro, and 500 bp genes in vivo which replicated in both normal and extreme bacteria. Together, this platform allows for practical and personalized DNA data storage, potentially with a wide range of applications.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call