Compatible priors for model selection of high-dimensional Gaussian DAGs

Stefano Peluso,Guido Consonni

doi:10.1214/20-ejs1768

Abstract

Graphical models represent a powerful framework to incorporate conditional independence structure for the statistical analysis of high-dimensional data. In this paper we focus on Directed Acyclic Graphs (DAGs). In the Gaussian setting, a prior recently introduced for the parameters associated to the (modified) Cholesky decomposition of the precision matrix is the DAG-Wishart. The flexibility introduced through a rich choice of shape hyperparameters coupled with conjugacy are two desirable assets of this prior which are especially welcome for estimation and prediction. In this paper we look at the DAG-Wishart prior from the perspective of model selection, with special reference to its consistency properties in high dimensional settings. We show that Bayes factor consistency only holds when comparing two DAGs which do not belong to the same Markov equivalence class, equivalently they encode distinct conditional independencies; a similar result holds for posterior ratio consistency. We also prove that DAG-Wishart distributions with arbitrarily chosen hyperparameters will lead to incompatible priors for model selection, because they assign different marginal likelihoods to Markov equivalent graphs. To overcome this difficulty, we propose a constructive method to specify DAG-Wishart priors whose suitably constrained shape hyperparameters ensure compatibility for DAG model selection.

Highlights

The analysis of multivariate data often aims at understanding the dependence structures among variables, as in networks of protein-protein interactions
We show that Bayes factor consistency only holds when comparing two Directed Acyclic Graphs (DAGs) which do not belong to the same Markov equivalence class, equivalently they encode distinct conditional independencies; a similar result holds for posterior ratio consistency
In the Bayesian framework this is cast as a model selection problem: a prior distribution is assigned to the parameter space of each DAG producing a marginal likelihood which, coupled with a prior distribution on the space of all DAGs, leads to the posterior distribution on model space

Summary

Introduction

The analysis of multivariate data often aims at understanding the dependence structures among variables, as in networks of protein-protein interactions. If no causal interpretation is attributed to DAGs (Lauritzen, 2001; Dawid, 2003), all DAGs in the same Markov equivalence class are indistinguishable based on obervational data, and should have the same marginal likelihood This represents a basic compatibility requirement for priors for DAG model selection, and is satisfied by the method of Geiger and Heckerman (2002). We highlight that DAG-Wishart priors without suitably specified hyperparameters will fail to assign the same marginal likelihood to Markov equivalent DAGs, and are not compatible for DAG model selection. This shortcoming is overcome in our third contribution where we present a constructive method to obtain compatible DAG-Wishart priors for model selection of high-dimensional DAGs, and show how this severely constrains the shape hyperparameters of the DAG-Wishart.

DAG-Wishart prior and equivalence classes

A simple motivating example

Posterior ratio consistency for equivalent graphs

Compatible DAG-Wishart prior

Simulation studies

Discussion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronic Journal of Statistics	Publication Date: Jan 1, 2020
Citations: 9	License type: cc-by

R Discovery Prime

R Discovery Prime

Compatible priors for model selection of high-dimensional Gaussian DAGs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronic Journal of Statistics

Lead the way for us

Similar Papers

Counting and Sampling from Markov Equivalent DAGs Using Clique Trees
Amiremad Ghassami ... Kun Zhang
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 33
Amiremad Ghassami, et. al.Amiremad Ghassami ... Kun Zhang
17 Jul 2019
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 33

A characterization of Markov equivalence classes for acyclic digraphs
Steen A Andersson ... David Madigan
The Annals of Statistics | VOL. 25
Steen A Andersson, et. al.Steen A Andersson ... David Madigan
01 Apr 1997
The Annals of Statistics | VOL. 25

Characterizing Markov equivalence classes for AMP chain graph models
Steen A Andersson ... Michael D Perlman
The Annals of Statistics | VOL. 34
Steen A Andersson, et. al.Steen A Andersson ... Michael D Perlman
01 Apr 2006
The Annals of Statistics | VOL. 34

Learning Markov Equivalence Classes of Directed Acyclic Graphs: An Objective Bayes Approach
Federico Castelletti ... Guido Consonni
Bayesian Analysis | VOL. 13
Federico Castelletti, et. al.Federico Castelletti ... Guido Consonni
01 Dec 2018
Bayesian Analysis | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Compatible priors for model selection of high-dimensional Gaussian DAGs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronic Journal of Statistics