Abstract

The exponential growth of high-throughput DNA sequence data has brought great challenges in data processing, archive and transmission. How to improve compression techniques for large datasets of sequence read archive has become a critical problem in store and analyzes biological data. The paper compared the existing data compression methods on five high-throughput sequence datasets, and proposed a novel method to compress high-throughput sequence read archive data. The experiment results show that the proposed compression method could get good compression ration and implement higher processing speed.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call