Graph summarization with quality guarantees

Matteo Riondato,Francesco Bonchi,David García-Soriano

doi:10.1007/s10618-016-0468-8

Abstract

We study the problem of graph summarization. Given a large graph we aim at producing a concise lossy representation (a summary) that can be stored in main memory and used to approximately answer queries about the original graph much faster than by using the exact representation. In this work we study a very natural type of summary: the original set of vertices is partitioned into a small number of supernodes connected by superedges to form a complete weighted graph. The superedge weights are the edge densities between vertices in the corresponding supernodes. To quantify the dissimilarity between the original graph and a summary, we adopt the reconstruction error and the cut-norm error. By exposing a connection between graph summarization and geometric clustering problems (i.e., k-means and k-median), we develop the first polynomial-time approximation algorithms to compute the best possible summary of a certain size under both measures. We discuss how to use our summaries to store a (lossy or lossless) compressed graph representation and to approximately answer a large class of queries about the original graph, including adjacency, degree, eigenvector centrality, and triangle and subgraph counting. Using the summary to answer queries is very efficient as the running time to compute the answer depends on the number of supernodes in the summary, rather than the number of nodes in the original graph.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Graph summarization with quality guarantees

Abstract

Talk to us

Similar Papers

More From: Data Mining and Knowledge Discovery

Lead the way for us

Journal: Data Mining and Knowledge Discovery	Publication Date: Jun 6, 2016
Citations: 57

Similar Papers

Graph Summarization with Quality Guarantees
Matteo Riondato ... Francesco Bonchi
-
Matteo Riondato, et. al.Matteo Riondato ... Francesco Bonchi
01 Dec 2014
01 Dec 2014

Scaling R-GCN Training with Graph Summarization
Alessandro Generale ... Michael Cochez
-
Alessandro Generale, et. al.Alessandro Generale ... Michael Cochez
25 Apr 2022
25 Apr 2022

A Provable Framework of Learning Graph Embeddings via Summarization
Houquan Zhou ... Xueqi Cheng
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 37
Houquan Zhou, et. al.Houquan Zhou ... Xueqi Cheng
26 Jun 2023
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 37

Poligras: Policy-Based Graph Summarization
Jiyang Bai ... Peixiang Zhao
Proceedings of the VLDB Endowment | VOL. 17
Jiyang Bai, et. al.Jiyang Bai ... Peixiang Zhao
01 Jun 2024
Proceedings of the VLDB Endowment | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Graph summarization with quality guarantees

Abstract

Talk to us

Similar Papers

More From: Data Mining and Knowledge Discovery