Scedar: A scalable Python package for single-cell RNA-seq exploratory data analysis.

Yuanchao Zhang,Erin R Reichenberger,Ben Stear,Man S Kim,Deanne M Taylor

doi:10.1371/journal.pcbi.1007794

Abstract

In single-cell RNA-seq (scRNA-seq) experiments, the number of individual cells has increased exponentially, and the sequencing depth of each cell has decreased significantly. As a result, analyzing scRNA-seq data requires extensive considerations of program efficiency and method selection. In order to reduce the complexity of scRNA-seq data analysis, we present scedar, a scalable Python package for scRNA-seq exploratory data analysis. The package provides a convenient and reliable interface for performing visualization, imputation of gene dropouts, detection of rare transcriptomic profiles, and clustering on large-scale scRNA-seq datasets. The analytical methods are efficient, and they also do not assume that the data follow certain statistical distributions. The package is extensible and modular, which would facilitate the further development of functionalities for future requirements with the open-source development community. The scedar package is distributed under the terms of the MIT license at https://pypi.org/project/scedar.

Highlights

Cost-effective large-scale transcriptomic profiling of individual cells is enabled by the development of microfluidic, nanodroplet, and massively parallel sequencing technologies
We illustrate the basic workflow of using scedar for scRNA-seq exploratory data analysis with the dataset published by Zeisel et al [53] (Fig 1)
The biological results of scRNA-seq experiments are obtained from extensive data analyses, which could take more time than doing the experiments

Summary

Introduction

Cost-effective large-scale transcriptomic profiling of individual cells is enabled by the development of microfluidic, nanodroplet, and massively parallel sequencing technologies. Using these technologies, single-cell RNA-seq (scRNA-seq) experiments usually generate transcriptomic profiles of thousands to millions of individual cells [1]. Analyses of scRNA-seq datasets require efficient computational programs and sophisticated statistical methods. The programs should be able to manage memory efficiently, exploit multiple cores of the processing units, and handle errors and exceptions gracefully. The statistical methods must be able to function against high dimensionality, low signal-to-noise ratio, and different characteristics of data generated from different technologies and protocols

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS Computational Biology	Publication Date: Apr 27, 2020
Citations: 11	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Scedar: A scalable Python package for single-cell RNA-seq exploratory data analysis.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology

Lead the way for us

Similar Papers

Systematic determination of the mitochondrial proportion in human and mice tissues for single-cell RNA-sequencing data quality control.
Daniel Osorio ... James J Cai
Bioinformatics | VOL. 37
Daniel Osorio, et. al.Daniel Osorio ... James J Cai
25 Aug 2020
Bioinformatics | VOL. 37

Evaluation of deep learning-based feature selection for single-cell RNA sequencing data analysis
Hao Huang ... Pengyi Yang
Genome biology | VOL. 24
Hao Huang, et. al.Hao Huang ... Pengyi Yang
10 Nov 2023
Genome biology | VOL. 24

Data Analysis in Single-Cell Transcriptome Sequencing.
Shan Gao
Methods in molecular biology (Clifton, N.J.) | VOL. 1754
Shan GaoShan Gao
01 Jan 2018
Methods in molecular biology (Clifton, N.J.) | VOL. 1754

PopsicleR: A R Package for Pre-processing and Quality Control Analysis of Single Cell RNA-seq Data
Francesco Grandi ... Silvio Bicciato
Journal of Molecular Biology | VOL. 434
Francesco Grandi, et. al.Francesco Grandi ... Silvio Bicciato
24 Mar 2022
Journal of Molecular Biology | VOL. 434

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Scedar: A scalable Python package for single-cell RNA-seq exploratory data analysis.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology