Lumáwig: An Efficient Algorithm for Dimension Zero Bottleneck Distance Computation in Topological Data Analysis

Paul Samuel Ignacio,Jay-Anne Bulauan,David Uminsky

doi:10.3390/a13110291

Paul Samuel Ignacio, Jay-Anne Bulauan + Show 1 more

Open Access

https://doi.org/10.3390/a13110291

Copy DOI

Abstract

Stability of persistence diagrams under slight perturbations is a key characteristic behind the validity and growing popularity of topological data analysis in exploring real-world data. Central to this stability is the use of Bottleneck distance which entails matching points between diagrams. Instances of use of this metric in practical studies have, however, been few and sparingly far between because of the computational obstruction, especially in dimension zero where the computational cost explodes with the growth of data size. We present a novel efficient algorithm to compute dimension zero bottleneck distance between two persistent diagrams of a specific kind which runs significantly faster and provides significantly sharper approximates with respect to the output of the original algorithm than any other available algorithm. We bypass the overwhelming matching problem in previous implementations of the bottleneck distance, and prove that the zero dimensional bottleneck distance can be recovered from a very small number of matching cases. Partly in keeping with nomenclature traditions in this area of TDA, we name this algorithm Lumáwig as a nod to a deity in the northern Philippines, where the algorithm was developed. We show that Lumáwig generally enjoys linear complexity as shown by empirical tests. We also present an application that leverages dimension zero persistence diagrams and the bottleneck distance to produce features for classification tasks.

Highlights

Topological data analysis (TDA) has gathered significant interest from a wide range of researchers because of its novel approach and use of classical tools from algebraic topology for extracting descriptive features from data
By considering dimension 0 persistence diagrams induced from the Rips filtration, we can approach the problem via a different framework, birthing a new efficient algorithm for computing the bottleneck distance
To further investigate the observations above, we examine the performance of L UMÁWIG R in the computation of dimension zero bottleneck distance in four pairs of settings for size of the diagrams and the range of values the death times are drawn from

Summary

Introduction

Topological data analysis (TDA) has gathered significant interest from a wide range of researchers because of its novel approach and use of classical tools from algebraic topology for extracting descriptive features from data. They augment the Hopcroft-Karp algorithm [15] by appealing to a near-neighbor data structure (a k-d tree) to search for the best candidate pair for a query point, pruning from the search the subtrees (and all other candidates within them) whose enclosing box is further away from the query than the current best candidate This circumvents the overwhelming matching problem by significantly shrinking down the combination pool to retrieve the best matching. By considering dimension 0 persistence diagrams induced from the Rips filtration, we can approach the problem via a different framework, birthing a new efficient algorithm for computing the bottleneck distance. As a proof of concept, we use L UMÁWIG to generate features for the classification of digit images from the MNIST data set

Bypassing Matchings

Benchmarking

Benchmarking against All Available Algorithms

Benchmarking L UMÁWIG on Larger Data Sets

Complexity Analysis

L UMÁWIG in Digit Classification

Discussions and Conclusions

Findings

Repository for L UMÁWIG

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Algorithms	Publication Date: Nov 11, 2020
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Lumáwig: An Efficient Algorithm for Dimension Zero Bottleneck Distance Computation in Topological Data Analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Algorithms

Lead the way for us

Similar Papers

Symmetric functions for fast image retrieval with persistent homology
Alessia Angeli ... Ivan Tomba
Mathematical Methods in the Applied Sciences | VOL. 41
Alessia Angeli, et. al.Alessia Angeli ... Ivan Tomba
16 Oct 2018
Mathematical Methods in the Applied Sciences | VOL. 41

Embeddings of persistence diagrams into Hilbert spaces
Peter Bubenik ... Alexander Wagner
Journal of Applied and Computational Topology | VOL. 4
Peter Bubenik, et. al.Peter Bubenik ... Alexander Wagner
26 Jun 2020
Journal of Applied and Computational Topology | VOL. 4

Persistence-Based Pooling for Shape Pose Recognition
Thomas Bonis ... Frédéric Chazal
-
Thomas Bonis, et. al.Thomas Bonis ... Frédéric Chazal
01 Jan 2015
01 Jan 2015

Using Topological Data Analysis (TDA) and Persistent Homology to Analyze the Stock Markets in Singapore and Taiwan
Peter Tsung-Wen Yen ... Siew Ann Cheong
Frontiers in Physics | VOL. 9
Peter Tsung-Wen Yen, et. al.Peter Tsung-Wen Yen ... Siew Ann Cheong
04 Mar 2021
Frontiers in Physics | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Lumáwig: An Efficient Algorithm for Dimension Zero Bottleneck Distance Computation in Topological Data Analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Algorithms