Abstract Motivation Quality control of sequencing data is the first step in many sequencing workflows. Short- and long-read sequencing technologies have many commonalities with regards to quality control. Several quality control programs exist, however none possess a feature set that is adequate for both technologies. Quality Control programs aimed at Oxford Nanopore Technologies sequencing lack vital features such as adapter searching, overrepresented sequence analysis and duplication analysis. Results Sequali was developed to provide sequencing quality control for both short- and long-read sequencing technologies. It features adapter search, overrepresented sequence analysis and duplication analysis and supports FASTQ and uBAM inputs. It is significantly faster than comparable sequencing quality control programs for both short- and long-read sequencing technologies. Availability and Implementation Sequali is an open source Python application using C extensions and is freely available under the AGPL-3.0 license at https://github.com/rhpvorderman/sequali. The source code for each release is archived at zenodo: https://zenodo.org/doi/10.5281/zenodo.10822485.
Read full abstract