Topological data analysis in information space

Herbert Edelsbrunner ,Žiga Virk ,Hubert Wagner

doi:10.20382/jocg.v11i2a7

Topological data analysis in information space

Herbert Edelsbrunner , Žiga Virk + Show 1 more

https://doi.org/10.20382/jocg.v11i2a7

Copy DOI

Journal: Journal of Computational Geometry

Publication Date: Jan 1, 2019

Affiliation: Institute of Science and Technology Austria, University of Ljubljana

#Standard Euclidean Distance #Information-theoretic Justification + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Various kinds of data are routinely represented as discrete probability distributions. Examples include text documents summarized by histograms of word occurrences and images represented as histograms of oriented gradients. Viewing a discrete probability distribution as a point in the standard simplex of the appropriate dimension, we can understand collections of such objects in geometric and topological terms. Importantly, instead of using the standard Euclidean distance, we look into dissimilarity measures with information-theoretic justification, and we develop the theory needed for applying topological data analysis in this setting. In doing so, we emphasize constructions that enable the usage of existing computational topology software in this context.

Full Text