Efficient learning of discrete graphical models* *This article is an updated version of: Vuffray M, Misra S and Lokhov A 2020 Efficient Learning of Discrete Graphical Models Advances in Neural Information Processing Systems vol 33 eds H Larochelle, M Ranzato, R Hadsell, M F Balcan and H Lin (New York: Curran Associates) pp 13575–85.

Marc Vuffray,Sidhant Misra,Andrey Y Lokhov

doi:10.1088/1742-5468/ac3aea

Abstract

Graphical models are useful tools for describing structured high-dimensional probability distributions. Development of efficient algorithms for learning graphical models with least amount of data remains an active research topic. Reconstruction of graphical models that describe the statistics of discrete variables is a particularly challenging problem, for which the maximum likelihood approach is intractable. In this work, we provide the first sample-efficient method based on the interaction screening framework that allows one to provably learn fully general discrete factor models with node-specific discrete alphabets and multi-body interactions, specified in an arbitrary basis. We identify a single condition related to model parametrization that leads to rigorous guarantees on the recovery of model structure and parameters in any error norm, and is readily verifiable for a large class of models. Importantly, our bounds make explicit distinction between parameters that are proper to the model and priors used as an input to the algorithm. Finally, we show that the interaction screening framework includes all models previously considered in the literature as special cases, and for which our analysis shows a systematic improvement in sample complexity.

Highlights

Representing and understanding the structure of direct correlations between distinct random variables with graphical models is a fundamental task that is essential to scientific and engineering endeavors
Even though it has been later shown in [13] that regularized pseudo-likelihood supplemented with a crucial post-processing step leads to a structure estimator for pairwise binary models, strong numerical and theoretical evidence provided in that work demonstrated that RISE is superior in terms of worst-case sample complexity
We propose a generalization of the estimator RISE, first introduced in [18] for pairwise binary graphical models, in order to reconstruct general discrete graphical models defined in (1)

Summary

Introduction

Representing and understanding the structure of direct correlations between distinct random variables with graphical models is a fundamental task that is essential to scientific and engineering endeavors. The algorithm RISE suggested in this work is based on the minimization of a novel local convex loss function, called the Interaction Screening objective, supplemented with an l1 penalty to promote sparsity Even though it has been later shown in [13] that regularized pseudo-likelihood supplemented with a crucial post-processing step leads to a structure estimator for pairwise binary models, strong numerical and theoretical evidence provided in that work demonstrated that RISE is superior in terms of worst-case sample complexity. The so-called SPARSITRON algorithm in [12] has the flavor of a stochastic first order method with multiplicative updates It has a low computational complexity and is sample-efficient for structure recovery of two subclasses of discrete graphical models: multiwise graphical models over binary variables or pairwise models with general alphabets. We provide a fully parallelizable algorithmic formulation for the GRISE estimator and SUPRISE algorithm, and show that they have efficient run times of O(pL) for a model of size p with L-order interactions, that includes the best-known O(p2) scaling for pairwise models

Parameterized family of models

Model selection problem

Sufficient conditions for well-posedness

Generalized regularized interaction screening estimator

Error bound on parameter estimation with GRISE

Structure identification and parameter estimation

The structure of graphical models

From local learnability condition to nonsingular parametrization of cliques

Construct the induced sub-graph

Conclusion and future work

A About well-posedness and local learnability conditions

B Proofs of GRISE estimation error bound

Gradient concentration

Restricted strong convexity

C Efficient implementation of GRISE and its computational complexity

Dimension independence and easier computation of NPC constants

Proofs of estimation guarantees for the SUPRISE algorithm

Binary models on the monomial basis

Findings

L-wise models with arbitrary alphabets on the indicator basis

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Statistical Mechanics: Theory and Experiment

Lead the way for us

Journal: Journal of Statistical Mechanics: Theory and Experiment	Publication Date: Dec 1, 2021
License type: cc-by

Similar Papers

Quantum circuits for discrete graphical models
Nico Piatkowski ... Christa Zoufal
Quantum Machine Intelligence | VOL. 6
Nico Piatkowski, et. al.Nico Piatkowski ... Christa Zoufal
28 Jun 2024
Quantum Machine Intelligence | VOL. 6

Applications of Graphical Models in Quantitative Genetics and Genomics
Guilherme J M Rosa ... Vivian P S Felipe
-
Guilherme J M Rosa, et. al.Guilherme J M Rosa ... Vivian P S Felipe
01 Jan 2015
01 Jan 2015

Beyond Gaussian : copula graphical models for non-Gaussian data
Hang Yu
-
Hang YuHang Yu
01 Jan 2015
01 Jan 2015

Statistical Methods to Study Timing of Vulnerability with Sparsely Sampled Data on Environmental Toxicants
Brisa Ney Sánchez ... Howard Hu
Environmental Health Perspectives | VOL. 119
Brisa Ney Sánchez, et. al.Brisa Ney Sánchez ... Howard Hu
08 Dec 2010
Environmental Health Perspectives | VOL. 119

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Statistical Mechanics: Theory and Experiment