A multitask clustering approach for single-cell RNA-seq analysis in Recessive Dystrophic Epidermolysis Bullosa.

Huanan Zhang,Raphael Petegrosso,Catherine A A Lee,John R Garbe,Jakub Tolar,Zhuliu Li,Cindy R Eide,Rui Kuang

doi:10.1371/journal.pcbi.1006053

Huanan Zhang, Raphael Petegrosso + Show 6 more

Open Access

https://doi.org/10.1371/journal.pcbi.1006053

Copy DOI

Abstract

Single-cell RNA sequencing (scRNA-seq) has been widely applied to discover new cell types by detecting sub-populations in a heterogeneous group of cells. Since scRNA-seq experiments have lower read coverage/tag counts and introduce more technical biases compared to bulk RNA-seq experiments, the limited number of sampled cells combined with the experimental biases and other dataset specific variations presents a challenge to cross-dataset analysis and discovery of relevant biological variations across multiple cell populations. In this paper, we introduce a method of variance-driven multitask clustering of single-cell RNA-seq data (scVDMC) that utilizes multiple single-cell populations from biological replicates or different samples. scVDMC clusters single cells in multiple scRNA-seq experiments of similar cell types and markers but varying expression patterns such that the scRNA-seq data are better integrated than typical pooled analyses which only increase the sample size. By controlling the variance among the cell clusters within each dataset and across all the datasets, scVDMC detects cell sub-populations in each individual experiment with shared cell-type markers but varying cluster centers among all the experiments. Applied to two real scRNA-seq datasets with several replicates and one large-scale droplet-based dataset on three patient samples, scVDMC more accurately detected cell populations and known cell markers than pooled clustering and other recently proposed scRNA-seq clustering methods. In the case study applied to in-house Recessive Dystrophic Epidermolysis Bullosa (RDEB) scRNA-seq data, scVDMC revealed several new cell types and unknown markers validated by flow cytometry. MATLAB/Octave code available at https://github.com/kuanglab/scVDMC.

Highlights

In recent years, single-cell RNA sequencing has emerged as the dominant method for quantifying transcriptome-wide mRNA expression in individual cells
We have developed a multitask clustering method to address the cross-population clustering problem
We demonstrate that our multitask clustering method significantly improves clustering accuracy and marker discovery in three public scRNA-seq datasets and apply the method to an in-house Recessive Dystrophic Epidermolysis Bullosa (RDEB) dataset

Summary

Introduction

Single-cell RNA sequencing (scRNA-seq) has emerged as the dominant method for quantifying transcriptome-wide mRNA expression in individual cells. In addition to the noise and bias that exist in bulk RNAseq experiments, issues unique to scRNA-seq include those from biological sources, such as cell-cycle stage or cell size, as well as from technical/systematic sources, such as capture inefficiency, material degradation, sample contamination, amplification biases, GC content, and sequencing depth. These experimental biases and limitations cause uneven coverage of the entire transcriptome and result in an abundance of zero-coverage regions [3, 4]. When the scRNA-seq profiles from multiple patients are pooled together for clustering, the clusters will highly overlap with the division of the single cells by the sample origins rather than similar types such as pathogenic cells vs normal cells

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS Computational Biology	Publication Date: Apr 9, 2018
Citations: 33	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A multitask clustering approach for single-cell RNA-seq analysis in Recessive Dystrophic Epidermolysis Bullosa.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology

Lead the way for us

Similar Papers

Author response: High-resolution transcriptional and morphogenetic profiling of cells from micropatterned human ESC gastruloid cultures
Kyaw Thu Minn ... Samantha A Morris
-
Kyaw Thu Minn, et. al.Kyaw Thu Minn ... Samantha A Morris
04 Nov 2020
04 Nov 2020

Decision letter: Deconstructing cold-induced brown adipocyte neogenesis in mice
Patrick Seale ... Carlos Isales
-
Patrick Seale, et. al.Patrick Seale ... Carlos Isales
25 Jun 2022
25 Jun 2022

Decision letter: The single-cell chromatin accessibility landscape in mouse perinatal testis development
Deborah Bourc'his ... Marianne E Bronner
-
Deborah Bourc'his, et. al.Deborah Bourc'his ... Marianne E Bronner
31 Jan 2022
31 Jan 2022

Decision letter: Single-cell RNA sequencing of the Strongylocentrotus purpuratus larva reveals the blueprint of major cell types and nervous system of a non-chordate deuterostome
Pedro Martinez Serra ... Marianne E Bronner
-
Pedro Martinez Serra, et. al.Pedro Martinez Serra ... Marianne E Bronner
06 Jul 2021
06 Jul 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A multitask clustering approach for single-cell RNA-seq analysis in Recessive Dystrophic Epidermolysis Bullosa.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology