Regular Decomposition of Large Graphs: Foundation of a Sampling Approach to Stochastic Block Model Fitting

Hannu Reittu,Marianna Bolla,Ilkka Norros,Fülöp Bazsó,Tomi Räty

doi:10.1007/s41019-019-0084-x

Abstract

We analyze the performance of regular decomposition, a method for compression of large and dense graphs. This method is inspired by Szemerédi’s regularity lemma (SRL), a generic structural result of large and dense graphs. In our method, stochastic block model (SBM) is used as a model in maximum likelihood fitting to find a regular structure similar to the one predicted by SRL. Another ingredient of our method is Rissanen’s minimum description length principle (MDL). We consider scaling of algorithms to extremely large size of graphs by sampling a small subgraph. We continue our previous work on the subject by proving some experimentally found claims. Our theoretical setting does not assume that the graph is generated from a SBM. The task is to find a SBM that is optimal for modeling the given graph in the sense of MDL. This assumption matches with real-life situations when no random generative model is appropriate. Our aim is to show that regular decomposition is a viable and robust method for large graphs emerging, say, in Big Data area.

Highlights

In the conference paper [1] we conjectured the possibility of applying our regular decomposition algorithm [2] to very large graphs, for which the full adjacency information is not possible to process, using a sampling approach
Our future work will be dedicated to the case of sparse graphs, which is the most important in Big Data
Testable graph parameters are nonparametric statistics that can be consistently estimated by appropriate sampling, introduced by László Lovász and coauthors, see [17]

Summary

Introduction

In the conference paper [1] we conjectured the possibility of applying our regular decomposition algorithm [2] to very large graphs, for which the full adjacency information is not possible to process, using a sampling approach. We prove claims of the preceding paper and give precise conditions under which they are true. This method allows to abandon the customary assumption that the graph be generated by a SBM. Revealing and understanding various relations embedded in such large data sets is of special interest. In mathematical terms, such relations form a huge graph. Our method suggests a way to overcome such hurdles in the case of dense data

Objectives

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Data Science and Engineering	Publication Date: Mar 1, 2019
Citations: 11	License type: open-access

R Discovery Prime

R Discovery Prime

Regular Decomposition of Large Graphs: Foundation of a Sampling Approach to Stochastic Block Model Fitting

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Data Science and Engineering

Lead the way for us

Similar Papers

Analysis of large sparse graphs using regular decomposition of graph distance matrices
Hannu Reittu ... Lasse Leskela
-
Hannu Reittu, et. al.Hannu Reittu ... Lasse Leskela
01 Dec 2018
01 Dec 2018

Exact Recovery and Sharp Thresholds of Stochastic Ising Block Model
Min Ye
IEEE Transactions on Information Theory | VOL. 67
Min YeMin Ye
01 Dec 2021
IEEE Transactions on Information Theory | VOL. 67

Regular decomposition of large graphs and other structures: Scalability and robustness towards missing data
Hannu Reittu ... Ilkka Norros
-
Hannu Reittu, et. al.Hannu Reittu ... Ilkka Norros
01 Dec 2017
01 Dec 2017

A Semi-exact Algorithm for Quickly Computing A Maximum Weight Clique in Large Sparse Graphs
Shaowei Cai ... Yiyuan Wang
Journal of Artificial Intelligence Research | VOL. 72
Shaowei Cai, et. al.Shaowei Cai ... Yiyuan Wang
14 Sep 2021
Journal of Artificial Intelligence Research | VOL. 72

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Regular Decomposition of Large Graphs: Foundation of a Sampling Approach to Stochastic Block Model Fitting

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Data Science and Engineering