Demuxmix: Demultiplexing oligonucleotide-barcoded single-cell RNA sequencing data with regression mixture models.

Hans-Ulrich Klein

doi:10.1093/bioinformatics/btad481

Hans-Ulrich Klein

Open Access

https://doi.org/10.1093/bioinformatics/btad481

Copy DOI

Journal: Bioinformatics	Publication Date: Aug 1, 2023
Citations: 14	License type: CC BY 4.0

Affiliation: Columbia University Irving Medical Center

Abstract

Droplet-based single-cell RNA sequencing (scRNA-seq) is widely used in biomedical research for interrogating the transcriptomes of single cells on a large scale. Pooling and processing cells from different samples together can reduce costs and batch effects. To pool cells, they are often first labeled with hashtag oligonucleotides (HTOs). These HTOs are sequenced alongside the cells' RNA in the droplets and subsequently used to computationally assign each droplet to its sample of origin, a process referred to as demultiplexing. Accurate demultiplexing is crucial but can be challenging due to background HTOs, low-quality cells/cell debris, and multiplets. A new demultiplexing method based on negative binomial regression mixture models is introduced. The method, called demuxmix, implements two significant improvements. First, demuxmix's probabilistic classification framework provides error probabilities for droplet assignments that can be used to discard uncertain droplets and inform about the quality of the HTO data and the success of the demultiplexing process. Second, demuxmix utilizes the positive association between detected genes in the RNA library and HTO counts to explain parts of the variance in the HTO data resulting in improved droplet assignments. The improved performance of demuxmix compared to existing demultiplexing methods is assessed using real and simulated data. Finally, the feasibility of accurately demultiplexing experimental designs where non-labeled cells are pooled with labeled cells is demonstrated. R/Bioconductor package demuxmix (https://doi.org/doi:10.18129/B9.bioc.demuxmix). Supplementary data are available at Bioinformatics online.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Demuxmix: Demultiplexing oligonucleotide-barcoded single-cell RNA sequencing data with regression mixture models.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Similar Papers

Decision letter: Applying causal discovery to single-cell analyses using CausalCell
Babak Momeni ... Anna Akhmanova
-
Babak Momeni, et. al.Babak Momeni ... Anna Akhmanova
14 Aug 2022
14 Aug 2022

Author response: Applying causal discovery to single-cell analyses using CausalCell
Jielong Huang ... Hao Zhu
-
Jielong Huang, et. al.Jielong Huang ... Hao Zhu
23 Aug 2022
23 Aug 2022

SoupX removes ambient RNA contamination from droplet-based single-cell RNA sequencing data.
Matthew D Young ... Sam Behjati
GigaScience | VOL. 9
Matthew D Young, et. al.Matthew D Young ... Sam Behjati
26 Dec 2020
GigaScience | VOL. 9

Scalable single-cell RNA sequencing from full transcripts with Smart-seq3xpress
Michael Hagemann-Jensen ... Rickard Sandberg
Nature Biotechnology | VOL. 40
Michael Hagemann-Jensen, et. al.Michael Hagemann-Jensen ... Rickard Sandberg
30 May 2022
Nature Biotechnology | VOL. 40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Demuxmix: Demultiplexing oligonucleotide-barcoded single-cell RNA sequencing data with regression mixture models.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics