Core Dependency Networks

Alejandro Molina,Alexander Munteanu,Kristian Kersting

doi:10.1609/aaai.v32i1.11726

Abstract

Many applications infer the structure of a probabilistic graphical model from data to elucidate the relationships between variables. But how can we train graphical models on a massive data set? In this paper, we show how to construct coresets---compressed data sets which can be used as proxy for the original data and have provably bounded worst case error---for Gaussian dependency networks (DNs), i.e., cyclic directed graphical models over Gaussians, where the parents of each variable are its Markov blanket. Specifically, we prove that Gaussian DNs admit coresets of size independent of the size of the data set. Unfortunately, this does not extend to DNs over members of the exponential family in general. As we will prove, Poisson DNs do not admit small coresets. Despite this worst-case result, we will provide an argument why our coreset construction for DNs can still work well in practice on count data.To corroborate our theoretical results, we empirically evaluated the resulting Core DNs on real data sets. The results demonstrate significant gains over no or naive sub-sampling, even in the case of count data.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Core Dependency Networks

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Apr 29, 2018
Citations: 9

Similar Papers

AIDCOR: artificial immunity inspired density based clustering with outlier removal
Swarna Kamal Paul ... Parama Bhaumik
International Journal of Machine Learning and Cybernetics | VOL. 9
Swarna Kamal Paul, et. al.Swarna Kamal Paul ... Parama Bhaumik
03 Feb 2016
International Journal of Machine Learning and Cybernetics | VOL. 9

A Parallel Framework for Constraint-Based Bayesian Network Learning via Markov Blanket Discovery
Ankit Srivastava ... Sriram P Chockalingam
IEEE Transactions on Parallel and Distributed Systems | VOL. 34
Ankit Srivastava, et. al.Ankit Srivastava ... Sriram P Chockalingam
01 Jun 2023
IEEE Transactions on Parallel and Distributed Systems | VOL. 34

A Parallel Framework for Constraint-Based Bayesian Network Learning via Markov Blanket Discovery
Ankit Srivastava ... Sriram P Chockalingam
-
Ankit Srivastava, et. al.Ankit Srivastava ... Sriram P Chockalingam
01 Nov 2020
01 Nov 2020

Real and synthetic data sets for benchmarking key-value stores focusing on various data types and sizes
Hyuk-Yoon Kwon
Data in Brief | VOL. 30
Hyuk-Yoon KwonHyuk-Yoon Kwon
20 Mar 2020
Data in Brief | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Core Dependency Networks

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence