Abstract

• A novel foundation for topological pattern recognition of point cloud data, through persistent homology of graph approximations. • An extension of persistent homology to point cloud data of metric trees, including theoretical and experimental verification. • The first charting of cell trajectory data sets that explains some of the difficulties this field is confronted with. The rising field of Topological Data Analysis (TDA) provides a new approach to learning from data through persistence diagrams , which are topological signatures that quantify topological properties of data in a comparable manner. For point clouds, these diagrams are often derived from the Vietoris-Rips filtration—based on the metric equipped on the data—which allows one to deduce topological patterns such as components and cycles of the underlying space. In metric trees these diagrams often fail to capture other crucial topological properties, such as the present leaves and multifurcations. Prior methods and results for persistent homology attempting to overcome this issue mainly target Rips graphs, which are often unfavorable in case of non-uniform density across our point cloud. We therefore introduce a new theoretical foundation for learning a wider variety of topological patterns through any given graph . Given particular powerful functions defining persistence diagrams to summarize topological patterns, including the normalized centrality or eccentricity , we prove a new stability result, explicitly bounding the bottleneck distance between the true and empirical diagrams for metric trees. This bound is tight if the metric distortion obtained through the graph and its maximal edge-weight are small. Through a case study of gene expression data, we demonstrate that our newly introduced diagrams provide novel quality measures and insights into cell trajectory inference.

Highlights

  • Persistent homology [16]—the most prominently used and studied tool within the field of Topological Data Analysis (TDA) [6]—has led to many new applications to supervised and unsupervised machine learning

  • We show under which conditions functions lead to a nontrivial stability result—guaranteeing that our true and empirical persistence diagram are close—for graph approximations (Theorems 2.1 and 2.2)

  • The concept of persistent homology has its roots in the field of algebraic topology [18]

Read more

Summary

Introduction

Persistent homology [16]—the most prominently used and studied tool within the field of Topological Data Analysis (TDA) [6]—has led to many new applications to supervised and unsupervised machine learning. Many of the data sets to which persistent homology has been successfully applied, were already at least partially structured, in the form of a simplicial complex, i.e., a higher-dimensional generalization of a graph. The underlying topology is often difficult to reveal, due to the high dimensionality of the data, or noise Since they lack a naturally induced simplicial structure, computing persistent homology of point clouds is mostly feasible through the the Vietoris-Rips filtration [22]. This type of persistence—a measure of prominence or relevance of a topological feature—is often insufficient, as it merely detects gaps, cycles, voids, and higherdimensional holes in the model. As will be shown in this paper, they allow us to learn a wider variety of topological patterns in metric trees, in theory and in practice

Contributions
Background on persistent homology
Related work
Persistent homology through graph approximations
Stability through graph approximations
A new stability result for metric trees
Experiments
Synthetic data of metric trees
Cell trajectory data
Discussion and conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call