Minimising the Kullback-Leibler Divergence for Model Selection in Distributed Nonlinear Systems.

Oliver Cliff,Mikhail Prokopenko,Robert Fitch

doi:10.3390/e20020051

Abstract

The Kullback–Leibler (KL) divergence is a fundamental measure of information geometry that is used in a variety of contexts in artificial intelligence. We show that, when system dynamics are given by distributed nonlinear systems, this measure can be decomposed as a function of two information-theoretic measures, transfer entropy and stochastic interaction. More specifically, these measures are applicable when selecting a candidate model for a distributed system, where individual subsystems are coupled via latent variables and observed through a filter. We represent this model as a directed acyclic graph (DAG) that characterises the unidirectional coupling between subsystems. Standard approaches to structure learning are not applicable in this framework due to the hidden variables; however, we can exploit the properties of certain dynamical systems to formulate exact methods based on differential topology. We approach the problem by using reconstruction theorems to derive an analytical expression for the KL divergence of a candidate DAG from the observed dataset. Using this result, we present a scoring function based on transfer entropy to be used as a subroutine in a structure learning algorithm. We then demonstrate its use in recovering the structure of coupled Lorenz and Rössler systems.

Highlights

Distributed information processing systems are commonly studied in complex systems and machine learning research
Entropy 2018, 20, 51 multidisciplinary study in fields such as ecology [4], neuroscience [5,6], multi-agent systems [7,8,9], and various others that focus on artificial and biological networks [10]. We represent such a spatially distributed system as a probabilistic graphical model termed a synchronous graph dynamical system (GDS) [11,12], whose structure is given by a directed acyclic graph (DAG)
We have presented a principled method to compute the KL divergence for model selection in distributed dynamical systems based on concepts from differential topology

Summary

Introduction

Distributed information processing systems are commonly studied in complex systems and machine learning research. Entropy 2018, 20, 51 multidisciplinary study in fields such as ecology [4], neuroscience [5,6], multi-agent systems [7,8,9], and various others that focus on artificial and biological networks [10] We represent such a spatially distributed system as a probabilistic graphical model termed a synchronous graph dynamical system (GDS) [11,12], whose structure is given by a directed acyclic graph (DAG). We draw on state space reconstruction methods from differential topology to reformulate the KL divergence in terms of computable distributions Using this expression, we show that the maximum transfer entropy graph is the most likely to have generated the data.

Related Work

Notation

Representing Distributed Dynamical Systems as Probabilistic Graphical Models

Network Scoring Functions

Computing Conditional KL Divergence

A Tractable Expression via Embedding Theory

Information-Theoretic Interpretation

Application to Structure Learning

Penalising Transfer Entropy by Independence Tests

Implementation Details and Algorithm Analysis

Experimental Validation

Distributed Lorenz and Rössler Attractors

Case Study

Findings

Discussion and Future

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Entropy (Basel, Switzerland)	Publication Date: Jan 23, 2018
Citations: 21	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Minimising the Kullback-Leibler Divergence for Model Selection in Distributed Nonlinear Systems.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy (Basel, Switzerland)

Lead the way for us

Similar Papers

Variational Autoencoder with Implicit Optimal Priors
Hiroshi Takahashi ... Yuki Yamanaka
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 33
Hiroshi Takahashi, et. al.Hiroshi Takahashi ... Yuki Yamanaka
17 Jul 2019
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 33

Data-based Structure Learning to Identify Confounders – Air Pollution and Progression of Subclinical Atherosclerosis
Frauke Hennig ... Sarah Lucht
ISEE Conference Abstracts | VOL. 2017
Frauke Hennig, et. al.Frauke Hennig ... Sarah Lucht
01 Feb 2018
ISEE Conference Abstracts | VOL. 2017

Improved K2 algorithm for Bayesian network structure learning
Shahab Behjati ... Hamid Beigy
Engineering Applications of Artificial Intelligence | VOL. 91
Shahab Behjati, et. al.Shahab Behjati ... Hamid Beigy
01 Apr 2020
Engineering Applications of Artificial Intelligence | VOL. 91

A sparse structure learning algorithm for Gaussian Bayesian Network identification from high-dimensional data.
Shuai Huang ... Jing Li
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 35
Shuai Huang, et. al.Shuai Huang ... Jing Li
01 Jun 2013
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Minimising the Kullback-Leibler Divergence for Model Selection in Distributed Nonlinear Systems.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy (Basel, Switzerland)