Abstract
Over the years, several theoretical graph generation models have been proposed. Among the most prominent are: Erd\H{o}s-Renyi random graph model, Watts-Strogatz small world model, Albert-Barab\'{a}si preferential attachment model, Price citation model, and many more. Often, researchers working on an empirical graph want to know, which of the theoretical graph generation models is the closest, i.e., which theoretical model would generate a graph the most similar to the given empirical graph. Usually, in order to assess the similarity of two graphs, centrality measure distributions are compared. For a theoretical graph model this commonly means comparing the empirical graph to a realization of the theoretical graph model, where the realization is generated from the given model using arbitrarily set parameters. The similarity between centrality measure distributions can be measured using standard statistical tests, e.g., the Kolmogorov-Smirnov test of distances between cumulative distributions. This approach is both error-prone and leading to incorrect conclusions. In this work we present a general framework for comparing graphs with theoretical models. Our framework is twofold. First, we show how comparing the entropies of centrality measure distributions (degree centrality, betweenness centrality, closeness centrality, eigenvector centrality) can help assign an empirical graph to the most similar theoretical model. Second, we compare graphs with theoretical graph models based on the perceived complexity of graphs, which in turn is computed as the Kolmogorov Complexity (also known as the algorithmic entropy) of the graph's adjacency matrix. As the result, we introduce a robust and efficient method of assigning an empirical graph to the most probable theoretical graph model.
Highlights
Analysis of real-world networks can be greatly simplified by using artificial generative network models
The first experiment examines the stability of mean entropy of centrality measure distributions under varying parameter of the generative network model parameters
The work presented in this paper examines the usefulness of the entropy when applied to various graph characteristics
Summary
Analysis of real-world networks can be greatly simplified by using artificial generative network models. Over the years numerous generative models have been proposed in the scientific literature Most of these models focus on producing networks exhibiting certain properties, such as a particular distribution of vertex degrees, edge betweenness, or local clustering coefficients. Sometimes a model might be proposed to explain an unexpected empirical result, such as shrinking network diameter or densification of edges. Each such generative model is governed by a small set of parameters, but many models are unstable, i.e. they produce significantly different networks for even a minuscule modification of the parameter value
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.