Abstract

SummaryThe computational detection and exclusion of cellular doublets and/or multiplets is a cornerstone for the identification the true biological signals from single-cell RNA sequencing (scRNA-seq) data. Current methods do not sensitively identify both heterotypic and homotypic doublets and/or multiplets. Here, we describe a machine learning approach for doublet/multiplet detection utilizing VDJ-seq and/or CITE-seq data to predict their presence based on transcriptional features associated with identified hybrid droplets. This approach highlights the utility of leveraging multi-omic single-cell information for the generation of high-quality datasets. Our method has high sensitivity and specificity in inflammatory-cell-dominant scRNA-seq samples, thus presenting a powerful approach to ensuring high-quality scRNA-seq data.

Highlights

  • The use of single-cell RNA sequencing techniques has revolutionized the characterization of complex biological systems in health and diseased states; its fundamental success relies on the true representations of high-quality single cells

  • We propose a model-based classification of doublets based on the profiles of identified mixed-cell droplets (Figures 1A and S1). We applied these approaches to publicly available datasets containing RNA-seq, CITE-seq, and VDJ-seq modalities of peripheral blood mononuclear cells (PBMCs) from three healthy individuals

  • A total of 26,080 droplets were retained after filtering (7,024–11,382 per sample), for which the broad immune cell types were annotated through differential gene expression and CITE-seq marker expression (Figures 1B and S1C)

Read more

Summary

Introduction

The use of single-cell RNA sequencing (scRNA-seq) techniques has revolutionized the characterization of complex biological systems in health and diseased states; its fundamental success relies on the true representations of high-quality single cells. Many scRNA-seq techniques face significant limitations, including the capture of doublets (or multiplets) and/or low-quality or dying cells, which potentially confounds biological results. Doublets and multiplets are defined as the aggregation of two or more cells into single droplets during the cell capture step of scRNA-seq, resulting in hybrid transcriptomes (Zheng et al, 2017; Wolock et al, 2019; McGinnis et al, 2019). Most single-cell experiments perform dead cell exclusion, the protocols can be protracted and result in further cell death downstream. These processes can result in false discoveries of rare cell types, intermediate cell states and disease-associated transcriptomic signatures (Stegle et al, 2015; Ilicic et al, 2016). Prospective experimental planning might reduce the frequency of doublet/multiplet occurrence, such as capturing fewer cells during scRNA-seq library

Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.