Abstract

Over the years, several theoretical graph generation models have been proposed. Among the most prominent are: the Erdős–Renyi random graph model, Watts–Strogatz small world model, Albert–Barabási preferential attachment model, Price citation model, and many more. Often, researchers working with real-world data are interested in understanding the generative phenomena underlying their empirical graphs. They want to know which of the theoretical graph generation models would most probably generate a particular empirical graph. In other words, they expect some similarity assessment between the empirical graph and graphs artificially created from theoretical graph generation models. Usually, in order to assess the similarity of two graphs, centrality measure distributions are compared. For a theoretical graph model this means comparing the empirical graph to a single realization of a theoretical graph model, where the realization is generated from the given model using an arbitrary set of parameters. The similarity between centrality measure distributions can be measured using standard statistical tests, e.g., the Kolmogorov–Smirnov test of distances between cumulative distributions. However, this approach is both error-prone and leads to incorrect conclusions, as we show in our experiments. Therefore, we propose a new method for graph comparison and type classification by comparing the entropies of centrality measure distributions (degree centrality, betweenness centrality, closeness centrality). We demonstrate that our approach can help assign the empirical graph to the most similar theoretical model using a simple unsupervised learning method.

Highlights

  • Analysis of real-world graphs can be greatly simplified by using artificial generative graph models, which we will refer to as theoretical graph models throughout this paper

  • The first experiment examines the stability of the mean entropy of centrality measure distributions under varying parameters of the theoretical graph models

  • The work presented in this paper examines the usefulness of entropy when applied to various graph characteristics

Read more

Summary

Introduction

Analysis of real-world graphs can be greatly simplified by using artificial generative graph models, which we will refer to as theoretical graph models throughout this paper. Numerous theoretical graph models have been proposed in the scientific literature. Most of these models focus on producing graphs which exhibit certain properties, such as a particular distribution of vertex degrees, edge betweenness, or local clustering coefficients. Sometimes a model might be proposed to explain an unexpected empirical result, such as shrinking graph diameter or the densification of edges.

Objectives
Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.