Structure-preserving visualisation of high dimensional single-cell datasets

Benjamin Szubert,Claudia Monaco,Jennifer E Cole,Ignat Drozdov

doi:10.1038/s41598-019-45301-0

Benjamin Szubert, Claudia Monaco + Show 2 more

Open Access

https://doi.org/10.1038/s41598-019-45301-0

Copy DOI

Abstract

Single-cell technologies offer an unprecedented opportunity to effectively characterize cellular heterogeneity in health and disease. Nevertheless, visualisation and interpretation of these multi-dimensional datasets remains a challenge. We present a novel framework, ivis, for dimensionality reduction of single-cell expression data. ivis utilizes a siamese neural network architecture that is trained using a novel triplet loss function. Results on simulated and real datasets demonstrate that ivis preserves global data structures in a low-dimensional space, adds new data points to existing embeddings using a parametric mapping function, and scales linearly to hundreds of thousands of cells. ivis is made publicly available through Python and R interfaces on https://github.com/beringresearch/ivis.

Highlights

Characterising cellular composition is crucial for defining functional heterogeneity in health and disease[1]
Visualisation and interpretation of single-cell experiments are underpinned by dimensionality reduction (DR) techniques
Unsupervised Neural Network (NN) with multiple layers are trained by optimizing a target function, whilst an intermediate layer with small cardinality serves as a low dimensional representation of the input data[19,21]

Summary

Introduction

Characterising cellular composition is crucial for defining functional heterogeneity in health and disease[1]. Non-linear approaches, including the t-distributed Stochastic Neighbor Embedding (t-SNE) algorithm[11], have been shown to effectively capture complex data structures, outperforming linear projection methods such as Principal Component Analysis (PCA)[12,13] t-SNE has several limitations[14,15]. Due to non-parametric nature of t-SNE, addition of new data points to existing embeddings is not possible[11,15]. In this paper we introduce a scalable algorithm, ivis, which effectively captures local as well as global features of high-dimensional datasets. Ivis learns a parametric mapping from the high-dimensional space to low-dimensional embedding, facilitating seamless addition of new data points to the mapping function. We demonstrate that ivis preserves distances in low-dimensional projections, enabling biological interpretation. We validate our method using synthetic, cytometry by time of flight (CyTOF), and scRNA-seq datasets

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: Jun 20, 2019
Citations: 67	License type: open-access

R Discovery Prime

R Discovery Prime

Structure-preserving visualisation of high dimensional single-cell datasets

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

A Tool for Visualization and Analysis of Single-Cell RNA-Seq Data Based on Text Mining
Gennaro Gambardella ... Diego Di Bernardo
Frontiers in Genetics | VOL. 10
Gennaro Gambardella, et. al.Gennaro Gambardella ... Diego Di Bernardo
09 Aug 2019
Frontiers in Genetics | VOL. 10

Design and Implementation of Authentication System Using Deep Convoluted Siamese Network
Sumagna Dey ... Indrajit Das
-
Sumagna Dey, et. al.Sumagna Dey ... Indrajit Das
26 Feb 2022
26 Feb 2022

One-shot learning with triplet loss for vegetation classification tasks
A.V Uzhinskiy ... G.A Ososkov
Computer Optics | VOL. 45
A.V Uzhinskiy, et. al.A.V Uzhinskiy ... G.A Ososkov
01 Aug 2021
Computer Optics | VOL. 45

Analysis of Few-Shot Techniques for Fungal Plant Disease Classification and Evaluation of Clustering Capabilities Over Real Datasets.
Itziar Egusquiza ... Artzai Picon
Frontiers in Plant Science | VOL. 13
Itziar Egusquiza, et. al.Itziar Egusquiza ... Artzai Picon
07 Mar 2022
Frontiers in Plant Science | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Structure-preserving visualisation of high dimensional single-cell datasets

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports