Discrete latent embedding of single-cell chromatin accessibility sequencing data for uncovering cell heterogeneity.

Xuejian Cui,Rui Jiang,Zhen Li,Shengquan Chen,Xiaoyang Chen,Zijing Gao

doi:10.1038/s43588-024-00625-4

Abstract

Single-cell epigenomic data has been growing continuously at an unprecedented pace, but their characteristics such as high dimensionality and sparsity pose substantial challenges to downstream analysis. Although deep learning models-especially variational autoencoders-have been widely used to capture low-dimensional feature embeddings, the prevalent Gaussian assumption somewhat disagrees with real data, and these models tend to struggle to incorporate reference information from abundant cell atlases. Here we propose CASTLE, a deep generative model based on the vector-quantized variational autoencoder framework to extract discrete latent embeddings that interpretably characterize single-cell chromatin accessibility sequencing data. We validate the performance and robustness of CASTLE for accurate cell-type identification and reasonable visualization compared with state-of-the-art methods. We demonstrate the advantages of CASTLE for effective incorporation of existing massive reference datasets in a weakly supervised or supervised manner. We further demonstrate CASTLE's capacity for intuitively distilling cell-type-specific feature spectra that unveil cell heterogeneity and biological implications quantitatively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Discrete latent embedding of single-cell chromatin accessibility sequencing data for uncovering cell heterogeneity.

Abstract

Talk to us

Similar Papers

More From: Nature computational science

Lead the way for us

Journal: Nature computational science	Publication Date: May 10, 2024
Citations: 3

Similar Papers

Conifer: clonal tree inference for tumor heterogeneity with single-cell and bulk sequencing data
Leila Baghaarabani ... Bahram Goliaei
BMC Bioinformatics | VOL. 22
Leila Baghaarabani, et. al.Leila Baghaarabani ... Bahram Goliaei
30 Aug 2021
BMC Bioinformatics | VOL. 22

Identifying Genetic Signatures from Single-Cell RNA Sequencing Data by Matrix Imputation and Reduced Set Gene Clustering
Soumita Seth ... Tapas Bhadra
Mathematics | VOL. 11
Soumita Seth, et. al.Soumita Seth ... Tapas Bhadra
17 Oct 2023
Mathematics | VOL. 11

Abstract LB019: Trisicell: Scalable Tumor Phylogeny Reconstruction and Validation Reveals Developmental Origin and Therapeutic Impact of Intratumoral Heterogeneity
Farid Rashidi Mehrabadi ... Huaitian Liu
Cancer Research | VOL. 81
Farid Rashidi Mehrabadi, et. al.Farid Rashidi Mehrabadi ... Huaitian Liu
01 Jul 2021
Cancer Research | VOL. 81

SingleScan: a comprehensive resource for single-cell sequencing data processing and mining
Kun Wang ... Haoyang Cai
BMC Bioinformatics | VOL. 24
Kun Wang, et. al.Kun Wang ... Haoyang Cai
07 Dec 2023
BMC Bioinformatics | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Discrete latent embedding of single-cell chromatin accessibility sequencing data for uncovering cell heterogeneity.

Abstract

Talk to us

Similar Papers

More From: Nature computational science