Abstract

Graphical models represent a powerful framework to incorporate conditional independence structure for the statistical analysis of high-dimensional data. In this paper we focus on Directed Acyclic Graphs (DAGs). In the Gaussian setting, a prior recently introduced for the parameters associated to the (modified) Cholesky decomposition of the precision matrix is the DAG-Wishart. The flexibility introduced through a rich choice of shape hyperparameters coupled with conjugacy are two desirable assets of this prior which are especially welcome for estimation and prediction. In this paper we look at the DAG-Wishart prior from the perspective of model selection, with special reference to its consistency properties in high dimensional settings. We show that Bayes factor consistency only holds when comparing two DAGs which do not belong to the same Markov equivalence class, equivalently they encode distinct conditional independencies; a similar result holds for posterior ratio consistency. We also prove that DAG-Wishart distributions with arbitrarily chosen hyperparameters will lead to incompatible priors for model selection, because they assign different marginal likelihoods to Markov equivalent graphs. To overcome this difficulty, we propose a constructive method to specify DAG-Wishart priors whose suitably constrained shape hyperparameters ensure compatibility for DAG model selection.

Highlights

  • The analysis of multivariate data often aims at understanding the dependence structures among variables, as in networks of protein-protein interactions

  • We show that Bayes factor consistency only holds when comparing two Directed Acyclic Graphs (DAGs) which do not belong to the same Markov equivalence class, equivalently they encode distinct conditional independencies; a similar result holds for posterior ratio consistency

  • In the Bayesian framework this is cast as a model selection problem: a prior distribution is assigned to the parameter space of each DAG producing a marginal likelihood which, coupled with a prior distribution on the space of all DAGs, leads to the posterior distribution on model space

Read more

Summary

Introduction

The analysis of multivariate data often aims at understanding the dependence structures among variables, as in networks of protein-protein interactions. If no causal interpretation is attributed to DAGs (Lauritzen, 2001; Dawid, 2003), all DAGs in the same Markov equivalence class are indistinguishable based on obervational data, and should have the same marginal likelihood This represents a basic compatibility requirement for priors for DAG model selection, and is satisfied by the method of Geiger and Heckerman (2002). We highlight that DAG-Wishart priors without suitably specified hyperparameters will fail to assign the same marginal likelihood to Markov equivalent DAGs, and are not compatible for DAG model selection. This shortcoming is overcome in our third contribution where we present a constructive method to obtain compatible DAG-Wishart priors for model selection of high-dimensional DAGs, and show how this severely constrains the shape hyperparameters of the DAG-Wishart.

DAG-Wishart prior and equivalence classes
A simple motivating example
Posterior ratio consistency for equivalent graphs
Compatible DAG-Wishart prior
Simulation studies
Discussion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.