Abstract

Abstract In single-cell RNA sequencing analysis, several computational methods have been developed to map the cellular state space, but little has been done to map the gene space. A mapping that preserves gene-gene relationships within the dataset is particularly useful for characterizing cellular heterogeneity within cell types, where boundaries between cell subpopulations are often unclear or even arbitrary. Here, we present gene signal pattern analysis, a new paradigm for analyzing single cells. We build a cell-cell graph and design a dictionary of diffusion wavelets, capturing a multiscale view of the cell space. We then transform genes by the dictionary and learn a reduced gene representation. Given the gap in prior research for this problem, we design nine alternative strategies and three benchmarks for evaluating preservation of gene-gene relationships, all of which are outperformed by diffusion wavelet-transformed signals. We also define, calculate, and evaluate localization, a key property of a gene signal on the cellular graph. We demonstrate the utility of gene signal pattern analysis on T cells from a mouse model of peripheral tolerance in skin. The gene space mapping reveals a continuum of gene signals characterized by T cell subtypes and transcriptional programs related to effector function and proliferation. Furthermore, we built a multiscale manifold of 48 melanoma patient samples, demonstrating the ability of our method to characterize differences between responders and non-responders to checkpoint immunotherapy. Together, we show gene signal pattern analysis, through methodology from graph signal processing, spectral graph theory, and machine learning, represents an avenue for future research in scRNA-seq analysis. Gruber Foundation

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call