The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants

Peter J A Cock,Naohisa Goto,Christopher J Fields,Peter M Rice,Michael L Heuer

doi:10.1093/nar/gkp1137

The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants

Peter J A Cock, Naohisa Goto + Show 3 more

Open Access

https://doi.org/10.1093/nar/gkp1137

Copy DOI

Journal: Nucleic Acids Research	Publication Date: Dec 16, 2009
Citations: 1423	License type: CC BY-NC 2.0 UK

Affiliation: University of Illinois Urbana-Champaign, Osaka University, European Bioinformatics Institute, Wellcome Trust

#Base Quality Score #Open Access Publication + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

FASTQ has emerged as a common file format for sharing sequencing read data combining both the sequence and an associated per base quality score, despite lacking any formal definition to date, and existing in at least three incompatible variants. This article defines the FASTQ format, covering the original Sanger standard, the Solexa/Illumina variants and conversion between them, based on publicly available information such as the MAQ documentation and conventions recently agreed by the Open Bioinformatics Foundation projects Biopython, BioPerl, BioRuby, BioJava and EMBOSS. Being an open access publication, it is hoped that this description, with the example files provided as Supplementary Data, will serve in future as a reference for this important file format.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Nucleic Acids Research

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.