Exodus: sequencing-based pipeline for quantification of pooled variants.

Ilya Vainberg-Slutskin,Naomi B Zak,Iddo Weiner,Tal Cohen,Merav Bassan,Sharon Kredo-Russo,Jenia Gold,Noga Kowalsman,Inbar Gahali-Sass,Edith Kario,Yael Silberberg,Can Alkan

doi:10.1093/bioinformatics/btac319

Ilya Vainberg-Slutskin, Naomi B Zak + Show 10 more

Open Access

https://doi.org/10.1093/bioinformatics/btac319

Copy DOI

Journal: Bioinformatics (Oxford, England)	Publication Date: May 12, 2022
Citations: 2	License type: CC BY 4.0

Affiliation: Mapi Pharma (Israel)

Abstract

SummaryNext-Generation Sequencing is widely used as a tool for identifying and quantifying microorganisms pooled together in either natural or designed samples. However, a prominent obstacle is achieving correct quantification when the pooled microbes are genetically related. In such cases, the outcome mostly depends on the method used for assigning reads to the individual targets. To address this challenge, we have developed Exodus—a reference-based Python algorithm for quantification of genomes, including those that are highly similar, when they are sequenced together in a single mix. To test Exodus’ performance, we generated both empirical and in silico next-generation sequencing data of mixed genomes. When applying Exodus to these data, we observed median error rates varying between 0% and 0.21% as a function of the complexity of the mix. Importantly, no false negatives were recorded, demonstrating that Exodus’ likelihood of missing an existing genome is very low, even if the genome’s relative abundance is low and similar genomes are present in the same mix. Taken together, these data position Exodus as a reliable tool for identifying and quantifying genomes in mixed samples. Exodus is open source and free to use at: https://github.com/ilyavs/exodus.Availability and implementationExodus is implemented in Python within a Snakemake framework. It is available on GitHub alongside a docker containing the required dependencies: https://github.com/ilyavs/exodus. The data underlying this article will be shared on reasonable request to the corresponding author.Supplementary information Supplementary data are available at Bioinformatics online.

Full Text