Evaluating the performance of dropout imputation and clustering methods for single-cell RNA sequencing data

Junlin Xu,Lingyu Cui,Jujuan Zhuang,Yajie Meng,Pingping Bing,Binsheng He,Geng Tian,Choi Kwok Pui,Taoyang Wu,Bing Wang,Jialiang Yang

doi:10.1016/j.compbiomed.2022.105697

Abstract

Recent advances in single-cell RNA sequencing (scRNA-seq) provide exciting opportunities for transcriptome analysis at single-cell resolution. Clustering individual cells is a key step to reveal cell subtypes and infer cell lineage in scRNA-seq analysis. Although many dedicated algorithms have been proposed, clustering quality remains a computational challenge for scRNA-seq data, which is exacerbated by inflated zero counts due to various technical noise. To address this challenge, we assess the combinations of nine popular dropout imputation methods and eight clustering methods on a collection of 10 well-annotated scRNA-seq datasets with different sample sizes. Our results show that (i) imputation algorithms do typically improve the performance of clustering methods, and the quality of data visualization using t-Distributed Stochastic Neighbor Embedding; and (ii) the performance of a particular combination of imputation and clustering methods varies with dataset size. For example, the combination of single-cell analysis via expression recovery and Sparse Subspace Clustering (SSC) methods usually works well on smaller datasets, while the combination of adaptively-thresholded low-rank approximation and single-cell interpretation via multikernel learning (SIMLR) usually achieves the best performance on larger datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Evaluating the performance of dropout imputation and clustering methods for single-cell RNA sequencing data

Abstract

Talk to us

Similar Papers

More From: Computers in Biology and Medicine

Lead the way for us

Journal: Computers in Biology and Medicine	Publication Date: Jun 8, 2022
Citations: 14

Similar Papers

A Streamlined scRNA-Seq Data Analysis Framework Based on Improved Sparse Subspace Clustering
Jujuan Zhuang ... Tianqi Qu
IEEE Access | VOL. 9
Jujuan Zhuang, et. al.Jujuan Zhuang ... Tianqi Qu
01 Jan 2020
IEEE Access | VOL. 9

A data-driven clustering recommendation method for single-cell RNA-sequencing data
Yu Tian ... Min Li
Tsinghua Science and Technology | VOL. 26
Yu Tian, et. al.Yu Tian ... Min Li
22 Apr 2021
Tsinghua Science and Technology | VOL. 26

Subspace Learning by $$\ell ^{0}$$ ℓ 0 -Induced Sparsity
Yingzhen Yang ... Thomas S Huang
International Journal of Computer Vision | VOL. 126
Yingzhen Yang, et. al.Yingzhen Yang ... Thomas S Huang
17 Jul 2018
International Journal of Computer Vision | VOL. 126

A multi-stage approach to clustering and imputation of gene expression profiles
Dorothy S V Wong ... Frederick K Wong
Bioinformatics | VOL. 23
Dorothy S V Wong, et. al.Dorothy S V Wong ... Frederick K Wong
18 Feb 2007
Bioinformatics | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluating the performance of dropout imputation and clustering methods for single-cell RNA sequencing data

Abstract

Talk to us

Similar Papers

More From: Computers in Biology and Medicine