Convex Divergence Research Articles

H-theorem states that the entropy production is nonnegative and, therefore, the entropy of a closed system should monotonically change in time. In information processing, the entropy production is positive for random transformation of signals (the information processing lemma). Originally, the H-theorem and the information processing lemma were proved for the classical Boltzmann-Gibbs-Shannon entropy and for the correspondent divergence (the relative entropy). Many new entropies and divergences have been proposed during last decades and for all of them the H-theorem is needed. This note proposes a simple and general criterion to check whether the H-theorem is valid for a convex divergence H and demonstrates that some of the popular divergences obey no H-theorem. We consider systems with n states Ai that obey first order kinetics (master equation). A convex function H is a Lyapunov function for all master equations with given equilibrium if and only if its conditional minima properly describe the equilibria of pair transitions Ai ⇌ Aj . This theorem does not depend on the principle of detailed balance and is valid for general Markov kinetics. Elementary analysis of pair equilibria demonstrate that the popular Bregman divergences like Euclidian distance or Itakura-Saito distance in the space of distribution cannot be the universal Lyapunov functions for the first-order kinetics and can increase in Markov processes. Therefore, they violate the second law and the information processing lemma. In particular, for these measures of information (divergences) random manipulation with data may add information to data. The main results are extended to nonlinear generalized mass action law kinetic equations.

This paper introduces scaled Bregman distances of probability distributions which admit nonuniform contributions of observed events. They are introduced in a general form covering not only the distances of discrete and continuous stochastic observations, but also the distances of random processes and signals. It is shown that the scaled Bregman distances extend not only the classical ones studied in the previous literature, but also the information divergence and the related wider class of convex divergences of probability measures. An information-processing theorem is established too, but only in the sense of invariance w.r.t. statistically sufficient transformations and not in the sense of universal monotonicity. Pathological situations where coding can increase the classical Bregman distance are illustrated by a concrete example. In addition to the classical areas of application of the Bregman distances and convex divergences such as recognition, classification, learning, and evaluation of proximity of various features and signals, the paper mentions a new application in 3-D exploratory data analysis. Explicit expressions for the scaled Bregman distances are obtained in general exponential families, with concrete applications in the binomial, Poisson, and Rayleigh families, and in the families of exponential processes such as the Poisson and diffusion processes including the classical examples of the Wiener process and geometric Brownian motion.

Convex Divergence Research Articles

Articles published on Convex Divergence

Martingale Methods for Sequential Estimation of Convex Functionals and Divergences

Trajectorial dissipation and gradient flow for the relative entropy in Markov chains

Strongly Convex Divergences.

Design and analysis of a high accuracy bidirectional thermal deformable mirror

Stronger Convergence Results for the Center-Based Fuzzy Clustering With Convex Divergence Measure.

Unsupervised Learning based Modified C- ICA for Audio Source Separation in Blind Scenario

Empirical Phi-discrepancies and quasi-empirical likelihood: exponential bounds

General H-theorem and Entropies that Violate the Second Law

On Bregman Distances and Divergences of Probability Measures

Convex Divergence ICA for Blind Source Separation

The α-EM algorithm: surrogate likelihood maximization using α-logarithmic information measures

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Convex Divergence Research Articles

Articles published on Convex Divergence

Martingale Methods for Sequential Estimation of Convex Functionals and Divergences

Trajectorial dissipation and gradient flow for the relative entropy in Markov chains

Strongly Convex Divergences.

Design and analysis of a high accuracy bidirectional thermal deformable mirror

Stronger Convergence Results for the Center-Based Fuzzy Clustering With Convex Divergence Measure.

Unsupervised Learning based Modified C- ICA for Audio Source Separation in Blind Scenario

Empirical Phi-discrepancies and quasi-empirical likelihood: exponential bounds

General H-theorem and Entropies that Violate the Second Law

On Bregman Distances and Divergences of Probability Measures

Convex Divergence ICA for Blind Source Separation

The α-EM algorithm: surrogate likelihood maximization using α-logarithmic information measures