Flexiplex: a versatile demultiplexer and search tool for omics data.

Oliver Cheng,Oliver Cheng,Changqing Wang,Nadia M Davidson,Changqing Wang,Shuyi Wu,Nadia M Davidson,Shuyi Wu,Matthew E Ritchie,Jonathan Göke,Matthew E Ritchie,Noorul Amin,Min Hao Ling,Jonathan Göke,Noorul Amin

doi:10.1093/bioinformatics/btae102

Oliver Cheng, Oliver Cheng + Show 13 more

https://doi.org/10.1093/bioinformatics/btae102

Copy DOI

Abstract

The process of analyzing high throughput sequencing data often requires the identification and extraction of specific target sequences. This could include tasks, such as identifying cellular barcodes and UMIs in single-cell data, and specific genetic variants for genotyping. However, existing tools, which perform these functions are often task-specific, such as only demultiplexing barcodes for a dedicated type of experiment, or are not tolerant to noise in the sequencing data. To overcome these limitations, we developed Flexiplex, a versatile and fast sequence searching and demultiplexing tool for omics data, which is based on the Levenshtein distance and thus allows imperfect matches. We demonstrate Flexiplex's application on three use cases, identifying cell-line-specific sequences in Illumina short-read single-cell data, and discovering and demultiplexing cellular barcodes from noisy long-read single-cell RNA-seq data. We show that Flexiplex achieves an excellent balance of accuracy and computational efficiency compared to leading task-specific tools. Flexiplex is available at https://davidsongroup.github.io/flexiplex/.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Bioinformatics (Oxford, England)	Publication Date: Feb 21, 2024
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Flexiplex: a versatile demultiplexer and search tool for omics data.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics (Oxford, England)

Lead the way for us

Similar Papers

A UNIFIED STATISTICAL FRAMEWORK FOR SINGLE CELL AND BULK RNA SEQUENCING DATA.
Lingxue Zhu ... Bernie Devlin
The Annals of Applied Statistics | VOL. 12
Lingxue Zhu, et. al.Lingxue Zhu ... Bernie Devlin
30 Mar 2017
The Annals of Applied Statistics | VOL. 12

Accurate and efficient cell lineage tree inference from noisy single cell data: the maximum likelihood perfect phylogeny approach
Yufeng Wu
Bioinformatics | VOL. 36
Yufeng WuYufeng Wu
28 Aug 2019
Bioinformatics | VOL. 36

Evaluating methods of inferring gene regulatory networks highlights their lack of performance for single cell gene expression data
Shuonan Chen ... Jessica C Mar
BMC Bioinformatics | VOL. 19
Shuonan Chen, et. al.Shuonan Chen ... Jessica C Mar
19 Jun 2018
BMC Bioinformatics | VOL. 19

Gpps: an ILP-based approach for inferring cancer progression with mutation losses from single cell data
Simone Ciccolella ... Murray D Patterson
BMC Bioinformatics | VOL. 21
Simone Ciccolella, et. al.Simone Ciccolella ... Murray D Patterson
01 Dec 2020
BMC Bioinformatics | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Flexiplex: a versatile demultiplexer and search tool for omics data.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics (Oxford, England)