Abstract

A central problem in graph mining is finding dense subgraphs, with several applications in different fields, a notable example being identifying communities. While a lot of effort has been put in the problem of finding a single dense subgraph, only recently the focus has been shifted to the problem of finding a set of densest subgraphs. An approach introduced to find possible overlapping subgraphs is the Top-k-Overlapping Densest Subgraphs problem. Given an integer k ge 1 and a parameter lambda > 0, the goal of this problem is to find a set of k dense subgraphs that may share some vertices. The objective function to be maximized takes into account the density of the subgraphs, the parameter lambda and the distance between each pair of subgraphs in the solution. The Top-k-Overlapping Densest Subgraphs problem has been shown to admit a frac{1}{10}-factor approximation algorithm. Furthermore, the computational complexity of the problem has been left open. In this paper, we present contributions concerning the approximability and the computational complexity of the problem. For the approximability, we present approximation algorithms that improve the approximation factor to frac{1}{2}, when k is smaller than the number of vertices in the graph, and to frac{2}{3}, when k is a constant. For the computational complexity, we show that the problem is NP-hard even when k=3.

Highlights

  • Complex systems are usually analyzed with graphs

  • The approximation algorithm for Top-k-Overlapping Densest Subgraphs returns the solution of maximum value between a solution obtained by iteratively solving Densest-Distinct-Subgraph and a solution consisting of k singletons

  • We have shown that Top-k-Overlapping Densest Subgraphs is NP-hard when k = 3 and we have given two approximation algorithms of factor

Read more

Summary

Introduction

Complex systems are usually analyzed with graphs. One of the most studied and central task to understand the behaviour of complex system is the identification of communities, that is cohesive subgraphs. Finding a clique of maximum size in a graph G = (V , E) is an NP-hard problem (Karp 1972) and it is even hard to approximate within factor O(|V |1−ε), for each ε > 0 (Zuckerman 2007). A definition of a dense subgraph that leads to a polynomial-time algorithm is that of average-degree density Other approaches related to Top-k-Overlapping Densest Subgraphs include covering or partitioning an input graph in dense subgraphs, like Minimum Clique Partition (Garey and Johnson 1979) or Minimum s-Club Covering (Dondi et al 2019). Top-k-Overlapping Densest Subgraphs when k is less than the number of vertices in the graph

Definitions
Goldberg’s algorithm and extended Goldberg’s algorithm
Approximating Top-k-Overlapping Densest Subgraphs
Approximation for constant k
A polynomial-time algorithm for Densest-Distinct-Subgraph
Approximation when k is not a constant
Description and analysis of phase 1
Result
Description and analysis of phase 2
Complexity of Top-k-Overlapping Densest Subgraphs
Indeed notice that zi
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call