Comparison of high-throughput sequencing data compression tools.

Ibrahim Numanagić,S Cenk Sahinalp,Marco Mattavelli,James K Bonfield,Faraz Hach,Jörn Ostermann,Claudio Alberti,Jan Voges

doi:10.1038/nmeth.4037

Comparison of high-throughput sequencing data compression tools.

Ibrahim Numanagić, S Cenk Sahinalp + Show 6 more

https://doi.org/10.1038/nmeth.4037

Copy DOI

Journal: Nature methods	Publication Date: Oct 24, 2016
Citations: 96

Affiliation: École Polytechnique Fédérale de Lausanne, Wellcome Sanger Institute, Simon Fraser University, Institut für Informationsverarbeitung, Leibniz University Hannover

#High-throughput Sequencing Data #Raw Sequencing Reads + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

High-throughput sequencing (HTS) data are commonly stored as raw sequencing reads in FASTQ format or as reads mapped to a reference, in SAM format, both with large memory footprints. Worldwide growth of HTS data has prompted the development of compression methods that aim to significantly reduce HTS data size. Here we report on a benchmarking study of available compression methods on a comprehensive set of HTS data using an automated framework.

Full Text