Abstract

<p>Scientific inquiry seeks to understand natural phenomena by understanding their underlying processes, i.e., by identifying cause and effect. In addition to mere scientific curiosity, an understanding of cause and effect relationships is necessary to predict the effect of changing dynamical regimes and for the attribution of extreme events to potential causes. It is thus an important question to ask how, in cases where controlled experiments are not feasible, causation can still be inferred from the statistical dependencies in observed time series.</p><p>A central obstacle for such an inference is the potential existence of unobserved causally relevant variables. Arguably, this is more likely to be the case than not, for example unmeasured deep oceanic variables in atmospheric processes. Unobserved variables can act as confounders (meaning they are a common cause of two or more observed variables) and thus introduce spurious, i.e., non-causal dependencies. Despite these complications, the last three decades have seen the development of so-called causal discovery algorithms (an example being FCI by Spirtes et al., 1999) that are often able to identify spurious associations and to distinguish them from genuine causation. This opens the possibility for a data-driven approach to infer cause and effect relationships among climate variables, thereby contributing to a better understanding of Earth's complex climate system.</p><p>These methods are, however, not yet well adapted to some specific challenges that climate time series often come with, e.g. strong autocorrelation, time lags and nonlinearities. To close this methodological gap, we generalize the ideas of the recent PCMCI causal discovery algorithm (Runge et al., 2019) to time series where unobserved causally relevant variables may exist (in contrast, PCMCI made the assumption of no confounding). Further, we present preliminary applications to modes of climate variability.</p>

Highlights

  • Causal Discovery for Climate Time Series in the Presence of Unobserved Variables Andreas Gerhardus and Jakob Runge German Aerospace Center, Institute of Data Science, Jena

  • Some of the unobserved variables may be causally relevant: Z is causally relevant if it is a cause of two observed variables X and Y 1 If Z is unobserved, it is called a hidden confounder or a hidden common cause

  • In previous work we introduced the PCMCI-Algorithm, a modification of the PC-Algorithm to better handle autocorrelated time series 1

Read more

Summary

Andreas Gerhardus and Jakob Runge

Scientific inquiry seeks to understand natural phenomena by understanding their underlying processes, i.e., by identifying cause and effect. Unobserved variables can act as confounders (meaning they are a common cause of two or more observed variables) and introduce spurious, i.e., non-causal dependencies Despite these complications, the last three decades have seen the development of so-called causal discovery algorithms (an example being FCI by Spirtes et al, 1999) that are often able to identify spurious associations and to distinguish them from genuine causation. The last three decades have seen the development of so-called causal discovery algorithms (an example being FCI by Spirtes et al, 1999) that are often able to identify spurious associations and to distinguish them from genuine causation This opens the possibility for a data-driven approach to infer cause and effect relationships among climate variables, thereby contributing to a better understanding of Earth's complex climate system.

Causal Relationships and Their Inference by Experimentation
Inference of Causal Relationships from Observational Data?
Unobserved Causally Relevant Variables
Causal Discovery With and Without Causal Sufficiency
Example of Discovering Causal Graphs with PC and FCI
Challenges for Causal Discovery in Climate Time Series
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call