Comparison of transformations for single-cell RNA-seq data

Constantin Ahlmann-Eltze,Wolfgang Huber

doi:10.1038/s41592-023-01814-1

Constantin Ahlmann-Eltze, Wolfgang Huber

Open Access

https://doi.org/10.1038/s41592-023-01814-1

Copy DOI

Abstract

The count table, a numeric matrix of genes × cells, is the basic input data structure in the analysis of single-cell RNA-sequencing data. A common preprocessing step is to adjust the counts for variable sampling efficiency and to transform them so that the variance is similar across the dynamic range. These steps are intended to make subsequent application of generic statistical methods more palatable. Here, we describe four transformation approaches based on the delta method, model residuals, inferred latent expression state and factor analysis. We compare their strengths and weaknesses and find that the latter three have appealing theoretical properties; however, in benchmarks using simulated and real-world data, it turns out that a rather simple approach, namely, the logarithm with a pseudo-count followed by principal-component analysis, performs as well or better than the more sophisticated alternatives. This result highlights limitations of current theoretical analysis as assessed by bottom-line performance benchmarks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nature methods	Publication Date: Apr 10, 2023
Citations: 53	License type: open-access

R Discovery Prime

R Discovery Prime

Comparison of transformations for single-cell RNA-seq data

Abstract

Talk to us

Similar Papers

More From: Nature methods

Lead the way for us

Similar Papers

Guildford Cathedral
J B G Smith
BMJ | VOL. 2
J B G SmithJ B G Smith
05 Jul 1958
BMJ | VOL. 2

Methods and challenges in the analysis of single-cell RNA-sequencing data
Pablo G Camara
Current Opinion in Systems Biology | VOL. 7
Pablo G CamaraPablo G Camara
27 Dec 2017
Current Opinion in Systems Biology | VOL. 7

Protocol for executing and benchmarking eight computational doublet-detection methods in single-cell RNA sequencing data analysis
Nan Miles Xi ... Jingyi Jessica Li
STAR Protocols | VOL. 2
Nan Miles Xi, et. al.Nan Miles Xi ... Jingyi Jessica Li
28 Jul 2021
STAR Protocols | VOL. 2

Allelic correlation is a marker of trade-offs between barriers to transmission of expression variability and signal responsiveness in genetic networks.
Ryan H Boe ... Vinay Ayyappan
Cell Systems | VOL. 13
Ryan H Boe, et. al.Ryan H Boe ... Vinay Ayyappan
01 Dec 2022
Cell Systems | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison of transformations for single-cell RNA-seq data

Abstract

Talk to us

Similar Papers

More From: Nature methods