Abstract

Information geometry studies the dually flat structure of a manifold, highlighted by the generalized Pythagorean theorem. The present paper studies a class of Bregman divergences called the (ρ,τ)-divergence. A (ρ,τ) -divergence generates a dually flat structure in the manifold of positive measures, as well as in the manifold of positive-definite matrices. The class is composed of decomposable divergences, which are written as a sum of componentwise divergences. Conversely, a decomposable dually flat divergence is shown to be a (ρ,τ) -divergence. A (ρ,τ) -divergence is determined from two monotone scalar functions, ρ and τ. The class includes the KL-divergence, α-, β- and (α, β)-divergences as special cases. The transformation between an affine parameter and its dual is easily calculated in the case of a decomposable divergence. Therefore, such a divergence is useful for obtaining the center for a cluster of points, which will be applied to classification and information retrieval in vision. For the manifold of positive-definite matrices, in addition to the dually flatness and decomposability, we require the invariance under linear transformations, in particular under orthogonal transformations. This opens a way to define a new class of divergences, called the (ρ,τ) -structure in the manifold of positive-definite matrices.

Highlights

  • Information geometry, originated from the invariant structure of a manifold of probability distributions, consists of a Riemannian metric and dually coupled affine connections with respect to Entropy 2014, 16 the metric [1]

  • This is different from the case of decomposable divergence, where any monotone pair of ρ(ξ) and τ (ξ) gives a dually flat structure

  • We extended it to the manifold of positive-definite matrices, where the criterion of invariance under linear transformations were added

Read more

Summary

Introduction

Information geometry, originated from the invariant structure of a manifold of probability distributions, consists of a Riemannian metric and dually coupled affine connections with respect to Entropy 2014, 16 the metric [1]. This is the (ρ, τ )-divergences, introduced by Zhang [5,6], from the point of view of “representation duality” They are Bregman divergences generating a dually flat structure. We analyze the cases when a (ρ, τ )-divergence is invariant under the general linear transformations, Gl(n), and invariant under the orthogonal transformations, O(n) They include many well-known divergences of PD matrices, and give new important divergences. It defines the cluster center due to a divergence. It gives dually flat decomposable affine coordinates and a related canonical divergence (Bregman divergence).

Dually Flat Manifold
Decomposable Divergence
Cluster Center
Convex Functions
Various Examples
Invariant and Decomposable Convex Function
Non-Decomposable Divergence
Non-Decomposable Flat Divergence in P Dn
Flat Structure Derived from q-Escort Distribution
Concluding Remarks
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call