Abstract

Various studies have linked several diseases, including cancer and COVID-19, to single nucleotide variations (SNV). Although single-cell RNA sequencing (scRNA-seq) technology can provide SNV and gene expression data, few studies have integrated and analyzed these multimodal data. To address this issue, we introduce Interpretable Single-cell Multimodal Data Integration Based on Variational Autoencoder (ISMI-VAE). ISMI-VAE leverages latent variable models that utilize the characteristics of SNV and gene expression data to overcome high noise levels and uses deep learning techniques to integrate multimodal information, map them to a low-dimensional space, and classify disease cells. Moreover, ISMI-VAE introduces an attention mechanism to reflect feature importance and analyze genetic features that could potentially cause disease. Experimental results on three cancer data sets and one COVID-19 data set demonstrate that ISMI-VAE surpasses the baseline method in terms of both effectiveness and interpretability and can effectively identify disease-causing gene features.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call