Partial Autoinformation to Characterize Symbolic Sequences.

Frederic Von Wegner

doi:10.3389/fphys.2018.01382

Abstract

An information-theoretic approach to numerically determine the Markov order of discrete stochastic processes defined over a finite state space is introduced. To measure statistical dependencies between different time points of symbolic time series, two information-theoretic measures are proposed. The first measure is time-lagged mutual information between the random variables Xn and Xn+k, representing the values of the process at time points n and n + k, respectively. The measure will be termed autoinformation, in analogy to the autocorrelation function for metric time series, but using Shannon entropy rather than linear correlation. This measure is complemented by the conditional mutual information between Xn and Xn+k, removing the influence of the intermediate values Xn+k−1, …, Xn+1. The second measure is termed partial autoinformation, in analogy to the partial autocorrelation function (PACF) in metric time series analysis. Mathematical relations with known quantities such as the entropy rate and active information storage are established. Both measures are applied to a number of examples, ranging from theoretical Markov and non-Markov processes with known stochastic properties, to models from statistical physics, and finally, to a discrete transform of an EEG data set. The combination of autoinformation and partial autoinformation yields important insights into the temporal structure of the data in all test cases. For first- and higher-order Markov processes, partial autoinformation correctly identifies the order parameter, but also suggests extended, non-Markovian effects in the examples that lack the Markov property. For three hidden Markov models (HMMs), the underlying Markov order is found. The combination of both quantities may be used as an early step in the analysis of experimental, non-metric time series and can be employed to discover higher-order Markov dependencies, non-Markovianity and periodicities in symbolic time series.

Highlights

AND BACKGROUNDInformation theory occupies a central role in time series analysis
Active information storage can be expressed as the difference of a joint entropy and the entropy rate: aX(n + k − 1, k) = I(Xn+k; X(nk+) k−1) = H(Xn+k) − H(Xn+k | X(nk+) k−1) = H(Xn+k) − hX(n + k − 1, k)
While autoinformation measures the statistical dependence between Xn and Xn+k directly, partial autoinformation removes the influence of the segment between both time points

Summary

Introduction

AND BACKGROUNDInformation theory occupies a central role in time series analysis. The concept of entropy provides numerous important connections to statistical physics and thermodynamics, often useful in the interpretation of the results (Kullback, 1959; Cover and Thomas, 2006). PAIF symbolic time series, collections of theory and methods are readily available (Daw et al, 2003; Mézard and Montanari, 2009). The result is a standardized procedure for analyzing continuous valued, discrete time stochastic processes (Box and Jenkins, 1976). The procedure addresses the impressive complexity of possible stochastic processes by combining semi-quantitative, visual analysis steps with a number of rigorous statistical test procedures. The first step in Box-Jenkins analysis is the visual and statistical assessment of the autocorrelation function (ACF) and the partial autocorrelation function (PACF) of the data. The order of purely autoregressive processes can be directly deduced from the PACF coefficients. For a p-th order autoregressive process, it can be shown that PACF coefficients for time lags larger than p are equal to zero, within statistical limits

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Physiology	Publication Date: Oct 11, 2018
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Partial Autoinformation to Characterize Symbolic Sequences.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Physiology

Lead the way for us

Similar Papers

Estimation of entropies and dimensions by nonlinear symbolic time series analysis.
J M Finn ... J D Goettee
Chaos: An Interdisciplinary Journal of Nonlinear Science | VOL. 13
J M Finn, et. al.J M Finn ... J D Goettee
01 Jun 2003
Chaos: An Interdisciplinary Journal of Nonlinear Science | VOL. 13

JIDT: An Information-Theoretic Toolkit for Studying the Dynamics of Complex Systems
Joseph T Lizier
Frontiers in Robotics and AI | VOL. 1
Joseph T LizierJoseph T Lizier
02 Dec 2014
Frontiers in Robotics and AI | VOL. 1

Copula-Based Dependence Characteriztions and Modeling for Time Series
Rustam Ibragimov
SSRN Electronic Journal | VOL. -
Rustam IbragimovRustam Ibragimov
21 Sep 2005
SSRN Electronic Journal | VOL. -

Hidden Markov Modeling-Based Decision-Making Using Short-Length Sensor Time Series
Najah F Ghalyan ... Asok Ray
Journal of Dynamic Systems, Measurement, and Control | VOL. 141
Najah F Ghalyan, et. al.Najah F Ghalyan ... Asok Ray
08 May 2019
Journal of Dynamic Systems, Measurement, and Control | VOL. 141

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Partial Autoinformation to Characterize Symbolic Sequences.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Physiology