Structure-Aware Principal Component Analysis for Single-Cell RNA-seq Data.

Snehalika Lall,Debajyoti Sinha,Debarka Sengupta,Sanghamitra Bandyopadhyay

doi:10.1089/cmb.2018.0027

Abstract

With the emergence of droplet-based technologies, it has now become possible to profile transcriptomes of several thousands of cells in a day. Although such a large single-cell cohort may favor the discovery of cellular heterogeneity, it also brings new challenges in the prediction of minority cell types. Identification of any minority cell type holds a special significance in knowledge discovery. In the analysis of single-cell expression data, the use of principal component analysis (PCA) is surprisingly frequent for dimension reduction. The principal directions obtained from PCA are usually dominated by the major cell types in the concerned tissue. Thus, it is very likely that using a traditional PCA may endanger the discovery of minority populations. To this end, we propose locality-sensitive PCA (LSPCA), a scalable variant of PCA equipped with structure-aware data sampling at its core. Structure-aware sampling provides PCA with a neutral spread of the data, thereby reducing the bias in its principal directions arising from the redundant samples in a data set. We benchmarked the performance of the proposed method on ten publicly available single-cell expression data sets including one very large annotated data set. Results have been compared with traditional PCA and PCA with random sampling. Clustering results on the annotated data sets also show that LSPCA can detect the minority populations with a higher accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Structure-Aware Principal Component Analysis for Single-Cell RNA-seq Data.

Abstract

Talk to us

Similar Papers

More From: Journal of Computational Biology

Lead the way for us

Journal: Journal of Computational Biology	Publication Date: Aug 22, 2018
Citations: 38

Similar Papers

Supervised Discriminative Sparse PCA for Com-Characteristic Gene Selection and Tumor Classification on Multiview Biological Data.
Chun-Mei Feng ... Yong Xu
IEEE Transactions on Neural Networks and Learning Systems | VOL. 30
Chun-Mei Feng, et. al.Chun-Mei Feng ... Yong Xu
22 Feb 2019
IEEE Transactions on Neural Networks and Learning Systems | VOL. 30

Applying Weighted PCA on Multiclass Classification for Intrusion Detection
Mohsen Moshki ... Peyman Kabiri
-
Mohsen Moshki, et. al.Mohsen Moshki ... Peyman Kabiri
01 Jan 2012
01 Jan 2012

Joint L2,p-norm and random walk graph constrained PCA for single-cell RNA-seq data
Tai-Ge Wang ... Juan Wang
Computer Methods in Biomechanics and Biomedical Engineering | VOL. ahead-of-print
Tai-Ge Wang, et. al.Tai-Ge Wang ... Juan Wang
06 Mar 2023
Computer Methods in Biomechanics and Biomedical Engineering | VOL. ahead-of-print

DGCyTOF: Deep learning with graphic cluster visualization to predict cell types of single cell mass cytometry data.
Lijun Cheng ... Lang Li
PLoS computational biology | VOL. 18
Lijun Cheng, et. al.Lijun Cheng ... Lang Li
11 Apr 2022
PLoS computational biology | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Structure-Aware Principal Component Analysis for Single-Cell RNA-seq Data.

Abstract

Talk to us

Similar Papers

More From: Journal of Computational Biology