Velo-Predictor: an ensemble learning pipeline for RNA velocity prediction

Xin Wang,Jie Zheng

doi:10.1186/s12859-021-04330-1

Abstract

BackgroundRNA velocity is a novel and powerful concept which enables the inference of dynamical cell state changes from seemingly static single-cell RNA sequencing (scRNA-seq) data. However, accurate estimation of RNA velocity is still a challenging problem, and the underlying kinetic mechanisms of transcriptional and splicing regulations are not fully clear. Moreover, scRNA-seq data tend to be sparse compared with possible cell states, and a given dataset of estimated RNA velocities needs imputation for some cell states not yet covered.ResultsWe formulate RNA velocity prediction as a supervised learning problem of classification for the first time, where a cell state space is divided into equal-sized segments by directions as classes, and the estimated RNA velocity vectors are considered as ground truth. We propose Velo-Predictor, an ensemble learning pipeline for predicting RNA velocities from scRNA-seq data. We test different models on two real datasets, Velo-Predictor exhibits good performance, especially when XGBoost was used as the base predictor. Parameter analysis and visualization also show that the method is robust and able to make biologically meaningful predictions.ConclusionThe accurate result shows that Velo-Predictor can effectively simplify the procedure by learning a predictive model from gene expression data, which could help to construct a continous landscape and give biologists an intuitive picture about the trend of cellular dynamics.

Highlights

RNA velocity is a novel and powerful concept which enables the inference of dynamical cell state changes from seemingly static single-cell RNA sequencing data
There are various approaches to trajectory reconstrcution, e.g. SCUBA [5] is based on bifurcation analysis, SCENT [6] and scEpath [7] use a measurement of entropy of cell states
=β · U (t) − γ · S(t), dt where S(t) represents mature mRNA abundance over time, U(t) represents pre-mRNA abundance over time, α is the rate of transcription, β is the rate of splicing, and γ is the rate of degradation. k and t are cell-specific latent variables, where k represents discrete transcriptional state, and t represents latent time

Summary

Introduction

RNA velocity is a novel and powerful concept which enables the inference of dynamical cell state changes from seemingly static single-cell RNA sequencing (scRNA-seq) data. Accurate estimation of RNA velocity is still a challenging problem, and the underlying kinetic mechanisms of transcriptional and splicing regulations are not fully clear. ScRNA-seq data tend to be sparse compared with possible cell states, and a given dataset of estimated RNA velocities needs imputation for some cell states not yet covered. Recent advances in high-throughput RNA sequencing technologies [1] have enabled analysis of transcription at single-cell level [2], which has provided immense opportunities to unravel the underlying mechanisms of gene expression regulation. Trajectory inference (including pseudotime analysis) is a primary task to identify cells in various states of differentiation [4]. HopLand [8] and Topslam [9] project cells to a landscape with optimized parameters

Objectives

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: May 1, 2021
Citations: 5	License type: open-access

R Discovery Prime

R Discovery Prime

Velo-Predictor: an ensemble learning pipeline for RNA velocity prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

SIRV: spatial inference of RNA velocity at the single-cell resolution.
Tamim Abdelaal ... Ahmed Mahfouz
NAR genomics and bioinformatics | VOL. 6
Tamim Abdelaal, et. al.Tamim Abdelaal ... Ahmed Mahfouz
02 Jul 2024
NAR genomics and bioinformatics | VOL. 6

Inferring Time-Lagged Causality Using the Derivative of Single-Cell Expression.
Huanhuan Wei ... Hongyu Zhao
International journal of molecular sciences | VOL. 23
Huanhuan Wei, et. al.Huanhuan Wei ... Hongyu Zhao
20 Mar 2022
International journal of molecular sciences | VOL. 23

Effectively Clustering Single Cell RNA Sequencing Data by Sparse Representation.
Rui-Yi Li ... Jihong Guan
IEEE/ACM Transactions on Computational Biology and Bioinformatics | VOL. 19
Rui-Yi Li, et. al.Rui-Yi Li ... Jihong Guan
01 Nov 2022
IEEE/ACM Transactions on Computational Biology and Bioinformatics | VOL. 19

Joint learning of multiple gene networks from single-cell gene expression data
Nuosi Wu ... Weixin Xie
Computational and Structural Biotechnology Journal | VOL. 18
Nuosi Wu, et. al.Nuosi Wu ... Weixin Xie
01 Jan 2020
Computational and Structural Biotechnology Journal | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Velo-Predictor: an ensemble learning pipeline for RNA velocity prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics