A probabilistic algorithm for aggregating vastly undersampled large Markov chains

Andreas Bittracher,Christof Schütte

doi:10.1016/j.physd.2020.132799

Andreas Bittracher, Christof Schütte

Open Access

https://doi.org/10.1016/j.physd.2020.132799

Copy DOI

Abstract

Model reduction of large Markov chains is an essential step in a wide array of techniques for understanding complex systems and for efficiently learning structures from high-dimensional data. We present a novel aggregation algorithm for compressing such chains that exploits a specific low-rank structure in the transition matrix which, e.g., is present in metastable systems, among others. It enables the recovery of the aggregates from a vastly undersampled transition matrix which in practical applications may gain a speedup of several orders of magnitude over methods that require the full transition matrix. Moreover, we show that the new technique is robust under perturbation of the transition matrix. The practical applicability of the new method is demonstrated by identifying a reduced model for the large-scale traffic flow patterns from real-world taxi trip data.

Highlights

Large-scale time- and space-discrete Markov chains are ubiquitous in many areas of quantitative science, where they arise as discretizations of continuous models [1,2,3,4], as formalization of network-based models [5,6,7], or as models of many other types of complex dynamics
Markov chains arising from discretized biomolecular systems often exhibit metastability, the phenomenon that on long time scales, the dynamics is determined by rare jumps between almost-invariant subsets of states [1,8,9]
We have demonstrated that in applications where the computation of the transition matrix is the computational bottleneck, this can lead to a speedup of factor 10 or more over conventional model reduction algorithms

Summary

Introduction

Large-scale time- and space-discrete Markov chains are ubiquitous in many areas of quantitative science, where they arise as discretizations of continuous models [1,2,3,4], as formalization of network-based models [5,6,7], or as models of many other types of complex dynamics. Markov chains arising from discretized biomolecular systems often exhibit metastability, the phenomenon that on long time scales, the dynamics is determined by rare jumps between almost-invariant subsets of states [1,8,9] Another example are complex traffic networks, whose transition matrices often exhibit a low-rank structure, which can be explained by patterns in the large-scale traffic flow between neighborhoods of a city [10,11]. Due to its probabilistic nature, the algorithm is able to exploit the low-rank structure of the full transition matrix without detailed knowledge of it This gives our method a computational advantage of several orders of magnitude over methods that require the full transition matrix. The entry of the ith row and jth column of A is denoted by Aij

Aggregatable Markov chains

Lumpability and deflatability

Almost aggregability

A probabilistic aggregation algorithm

Sparse recovery of the aggregates

Probabilistic column sampling

Applicability to almost aggregatable Markov chains

Recovery of the reduced transition matrix

Probabilistic matrix recovery

A generic almost aggregatable process

A discretized metastable Langevin process

Aggregation of Manhattan taxi trips

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Physica D: Nonlinear Phenomena	Publication Date: Nov 12, 2020
Citations: 5	License type: cc-by

R Discovery Prime

R Discovery Prime

A probabilistic algorithm for aggregating vastly undersampled large Markov chains

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Physica D: Nonlinear Phenomena

Lead the way for us

Similar Papers

Performance gradient estimation for the very large finite Markov chains
B Zhang ... Y.-C Ho
IEEE Transactions on Automatic Control | VOL. 36
B Zhang, et. al.B Zhang ... Y.-C Ho
01 Jan 1991
IEEE Transactions on Automatic Control | VOL. 36

On the efficient sequential and distributed generation of very large Markov chains from stochastic Petri nets
B Haverkort ... A Bell
-
B Haverkort, et. al.B Haverkort ... A Bell
08 Sep 1999
08 Sep 1999

A New Combinatorial Algorithm for Large Markov Chains (Extended Abstract)
Anna Gambin ... Piotr Pokarowski
-
Anna Gambin, et. al.Anna Gambin ... Piotr Pokarowski
01 Jan 2001
01 Jan 2001

Predictability of large-scale atmospheric flow patterns connected to extreme precipitation events in the Mediterranean
Nikolaos Mastrantonas ... Florian Pappenberger
-
Nikolaos Mastrantonas, et. al.Nikolaos Mastrantonas ... Florian Pappenberger
04 Mar 2021
04 Mar 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A probabilistic algorithm for aggregating vastly undersampled large Markov chains

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Physica D: Nonlinear Phenomena