Abstract

Methods of topological data analysis have been successfully applied in a wide range of fields to provide useful summaries of the structure of complex data sets in terms of topological descriptors, such as persistence diagrams. While there are many powerful techniques for computing topological descriptors, the inverse problem, i.e., recovering the input data from topological descriptors, has proved to be challenging. In this article, we study in detail the Topological Morphology Descriptor (TMD), which assigns a persistence diagram to any tree embedded in Euclidean space, and a sort of stochastic inverse to the TMD, the Topological Neuron Synthesis (TNS) algorithm, gaining both theoretical and computational insights into the relation between the two. We propose a new approach to classify barcodes using symmetric groups, which provides a concrete language to formulate our results. We investigate to what extent the TNS recovers a geometric tree from its TMD and describe the effect of different types of noise on the process of tree generation from persistence diagrams. We prove moreover that the TNS algorithm is stable with respect to specific types of noise.

Highlights

  • Geometric approaches to analyzing data have been extensively used for many years, the first topological methods for data analysis were developed only recently, e.g., [1,2,3,4,5,6].Topological Data Analysis (TDA) is a fairly new field at the intersection of data science and algebraic topology, the aim of which is to provide robust mathematical, statistical, and algorithmic methods to infer and analyze the topological and geometric structures underlying complex data

  • Motivated by the desire to objectively classify neuronal morphologies, in a previous publication (Kanari and Hess in [13]), we designed a topological signature for trees, the Topological Morphology Descriptor (TMD), that assigns a barcode to any geometric tree

  • The TMD (Topological Morphology Descriptor) is a many-to-one function from the set of geometric trees to the set of barcodes, TMD ∶ T → B, that encodes the overall shape of the tree, both the topology of the branching structure of a tree and its embedding in R3 [13]

Read more

Summary

Introduction

Geometric approaches to analyzing data have been extensively used for many years, the first topological methods for data analysis were developed only recently, e.g., [1,2,3,4,5,6]. Topological Data Analysis (TDA) is a fairly new field at the intersection of data science and algebraic topology, the aim of which is to provide robust mathematical, statistical, and algorithmic methods to infer and analyze the topological and geometric structures underlying complex data. That the probability that two bars of a barcode B will be permuted by applying TMD ○ TNS decreases exponentially with the distance between the terminations of the two bars, which is another form of stability Together, these stability results imply that the TNS is an excellent approximation to a (right) inverse to the TMD. The trees T and T ′ can be quite different combinatorially, as seen on the right

Mathematical Background
Barcodes
The TMD
The TNS
Elongation
The Elder Rule and TNS
Realizing Barcodes as Trees
The Combinatorics of Tree-Realization
Stability of the TNS
Bottleneck Stability
Transposition Stability
Computational Exploration of Tree-Realization
The Distribution of Tree-Realization Numbers
Empirical Distributions of Combinatorial Types of Trees
Diversity of Realized TMD-Equivalence Classes
Statistics of Changing Classes
Tree-Realizations of Biological Barcodes
Discussion
Objective
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.