Abstract

Single-cell RNA sequencing (scRNA-seq) technology has revolutionized biological research by enabling the sequencing of mRNA in individual cells, thereby providing valuable insights into cellular gene expression and functions. However, scRNA-seq data often contain false zero values known as dropout events, which can obscure true gene expression levels and compromise downstream analysis accuracy. To address this issue, several computational approaches have been proposed for imputing missing gene expression values. Nevertheless, these methods struggle to capture dropout value distributions due to the sparsity of scRNA-seq data and complex gene expression patterns. In this study, we present a novel method called scIDPMs that utilizes conditional diffusion probabilistic models to impute scRNA-seq data. Firstly, scIDPMs identifies dropout sites based on gene expression characteristics and subsequently infers the missing values by considering available gene expression information. To effectively capture global gene expression features, scIDPMs employs a deep neural network with an attention mechanism to optimize the imputation process. We evaluated the performance of scIDPMs using simulated and real scRNA-seq datasets and compared it with ten other imputation methods. The results indicate that scIDPMs outperform other methods in restoring biologically meaningful gene expression values and improving downstream analysis.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call