Synthetic Graphs Research Articles

We introduce the general problem of identifying a smallest edge subset of a given graph whose deletion makes the graph community-free. We consider this problem under two community notions that have attracted significant attention: k -truss and k -core. We also introduce a problem variant where the identified subset contains edges incident to a given set of nodes and ensures that these nodes are not contained in any community: k -truss or k -core, in our case. These problems are directly applicable in social networks: The identified edges can be hidden by users or sanitized from the output graph; or in communication networks: the identified edges correspond to vital network connections. We present a series of theoretical and practical results. On the theoretical side, we show through non-trivial reductions that the problems we introduce are NP-hard and, in fact, hard to approximate. For the k -truss-based problems, we also show exact exponential-time algorithms, as well as a non-trivial lower bound on the size of an optimal solution. On the practical side, we develop a series of heuristics that are sped up by efficient data structures that we propose for updating the truss or core decomposition under edge deletions. In addition, we develop an algorithm to compute the lower bound. Extensive experiments on 11 real-world and synthetic graphs show that our heuristics are effective, outperforming natural baselines, and also efficient (up to two orders of magnitude faster than a natural baseline), thanks to our data structures. Furthermore, we present a case study on a co-authorship network and experiments showing that the removal of edges identified by our heuristics does not substantially affect the clustering structure of the input graph. This work extends a KDD 2021 paper, providing new theoretical results as well as introducing core-based problems and algorithms.

Read full abstract

Community structure is a fundamental topological characteristic of optimally organized brain networks. Currently, there is no clear standard or systematic approach for selecting the most appropriate community detection method. Furthermore, the impact of method choice on the accuracy and robustness of estimated communities (and network modularity), as well as method-dependent relationships between network communities and cognitive and other individual measures, are not well understood. This study analyzed large datasets of real brain networks (estimated from resting-state fMRI from = 5251 pre/early adolescents in the adolescent brain cognitive development [ABCD] study), and = 5338 synthetic networks with heterogeneous, data-inspired topologies, with the goal to investigate and compare three classes of community detection methods: (i) modularity maximization-based (Newman and Louvain), (ii) probabilistic (Bayesian inference within the framework of stochastic block modeling (SBM)), and (iii) geometric (based on graph Ricci flow). Extensive comparisons between methods and their individual accuracy (relative to the ground truth in synthetic networks), and reliability (when applied to multiple fMRI runs from the same brains) suggest that the underlying brain network topology plays a critical role in the accuracy, reliability and agreement of community detection methods. Consistent method (dis)similarities, and their correlations with topological properties, were estimated across fMRI runs. Based on synthetic graphs, most methods performed similarly and had comparable high accuracy only in some topological regimes, specifically those corresponding to developed connectomes with at least quasi-optimal community organization. In contrast, in densely and/or weakly connected networks with difficult to detect communities, the methods yielded highly dissimilar results, with Bayesian inference within SBM having significantly higher accuracy compared to all others. Associations between method-specific modularity and demographic, anthropometric, physiological and cognitive parameters showed mostly method invariance but some method dependence as well. Although method sensitivity to different levels of community structure may in part explain method-dependent associations between modularity estimates and parameters of interest, method dependence also highlights potential issues of reliability and reproducibility. These findings suggest that a probabilistic approach, such as Bayesian inference in the framework of SBM, may provide consistently reliable estimates of community structure across network topologies. In addition, to maximize robustness of biological inferences, identified network communities and their cognitive, behavioral and other correlates should be confirmed with multiple reliable detection methods.

Read full abstract

Synthetic Graphs Research Articles

Related Topics

Articles published on Synthetic Graphs

Reliable and Faithful Generative Explainers for Graph Neural Networks

Synthetic graphs for link prediction benchmarking

Bias reduction via cooperative bargaining in synthetic graph dataset generation

Enhancing the Performance of Automated Scoring Model for Kinematic Graph Answers Using Synthetic Graph Images

Making It Tractable to Detect and Correct Errors in Graphs

Estimate Mass Density Value as A Priori Information for Gravity by using Bayesian Markov Chain Monte Carlo (MCMC)

Cross-community affinity: A polarization measure for multi-community networks

Unsupervised Learning for Lateral-Movement-Based Threat Mitigation in Active Directory Attack Graphs

Hardening Active Directory Graphs via Evolutionary Diversity Optimization based Policies

STEP: Sequence of time-aligned edge plots

ADPSCAN: Structural Graph Clustering with Adaptive Density Peak Selection and Noise Re-Clustering

Synthetic lethal connectivity and graph transformer improve synthetic lethality prediction.

MST: Topology-Aware Message Aggregation for Exascale Graph Processing of Traversal-Centric Algorithms

On the effectiveness of hybrid pooling in mixup-based graph learning for language processing

A simple and effective convolutional operator for node classification without features by graph convolutional networks.

On Breaking Truss-based and Core-based Communities

Community detection in the human connectome: Method types, differences and their impact on inference.

FulBM: Fast fully batch maintenance for landmark-based 3-hop cover labeling

Tensor Network Message Passing.

IWO-IGA—A Hybrid Whale Optimization Algorithm Featuring Improved Genetic Characteristics for Mapping Real-Time Applications onto 2D Network on Chip

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Synthetic Graphs Research Articles

Related Topics

Articles published on Synthetic Graphs

Reliable and Faithful Generative Explainers for Graph Neural Networks

Synthetic graphs for link prediction benchmarking

Bias reduction via cooperative bargaining in synthetic graph dataset generation

Enhancing the Performance of Automated Scoring Model for Kinematic Graph Answers Using Synthetic Graph Images

Making It Tractable to Detect and Correct Errors in Graphs

Estimate Mass Density Value as A Priori Information for Gravity by using Bayesian Markov Chain Monte Carlo (MCMC)

Cross-community affinity: A polarization measure for multi-community networks

Unsupervised Learning for Lateral-Movement-Based Threat Mitigation in Active Directory Attack Graphs

Hardening Active Directory Graphs via Evolutionary Diversity Optimization based Policies

STEP: Sequence of time-aligned edge plots

ADPSCAN: Structural Graph Clustering with Adaptive Density Peak Selection and Noise Re-Clustering

Synthetic lethal connectivity and graph transformer improve synthetic lethality prediction.

MST: Topology-Aware Message Aggregation for Exascale Graph Processing of Traversal-Centric Algorithms

On the effectiveness of hybrid pooling in mixup-based graph learning for language processing

A simple and effective convolutional operator for node classification without features by graph convolutional networks.

On Breaking Truss-based and Core-based Communities

Community detection in the human connectome: Method types, differences and their impact on inference.

FulBM: Fast fully batch maintenance for landmark-based 3-hop cover labeling

Tensor Network Message Passing.

IWO-IGA—A Hybrid Whale Optimization Algorithm Featuring Improved Genetic Characteristics for Mapping Real-Time Applications onto 2D Network on Chip