Abstract

BackgroundThe recent explosion in biological and other real-world network data has created the need for improved tools for large network analyses. In addition to well established global network properties, several new mathematical techniques for analyzing local structural properties of large networks have been developed. Small over-represented subgraphs, called network motifs, have been introduced to identify simple building blocks of complex networks. Small induced subgraphs, called graphlets, have been used to develop "network signatures" that summarize network topologies. Based on these network signatures, two new highly sensitive measures of network local structural similarities were designed: the relative graphlet frequency distance (RGF-distance) and the graphlet degree distribution agreement (GDD-agreement).Finding adequate null-models for biological networks is important in many research domains. Network properties are used to assess the fit of network models to the data. Various network models have been proposed. To date, there does not exist a software tool that measures the above mentioned local network properties. Moreover, none of the existing tools compare real-world networks against a series of network models with respect to these local as well as a multitude of global network properties.ResultsThus, we introduce GraphCrunch, a software tool that finds well-fitting network models by comparing large real-world networks against random graph models according to various network structural similarity measures. It has unique capabilities of finding computationally expensive RGF-distance and GDD-agreement measures. In addition, it computes several standard global network measures and thus supports the largest variety of network measures thus far. Also, it is the first software tool that compares real-world networks against a series of network models and that has built-in parallel computing capabilities allowing for a user specified list of machines on which to perform compute intensive searches for local network properties. Furthermore, GraphCrunch is easily extendible to include additional network measures and models.ConclusionGraphCrunch is a software tool that implements the latest research on biological network models and properties: it compares real-world networks against a series of random graph models with respect to a multitude of local and global network properties. We present GraphCrunch as a comprehensive, parallelizable, and easily extendible software tool for analyzing and modeling large biological networks. The software is open-source and freely available at . It runs under Linux, MacOS, and Windows Cygwin. In addition, it has an easy to use on-line web user interface that is available from the above web page.

Highlights

  • The recent explosion in biological and other real-world network data has created the need for improved tools for large network analyses

  • Results: we introduce GraphCrunch, a software tool that finds well-fitting network models by comparing large real-world networks against random graph models according to various network structural similarity measures

  • GraphCrunch is a software tool that implements the latest research on biological network models and properties: it compares real-world networks against a series of random graph models with respect to a multitude of local and global network properties

Read more

Summary

Introduction

The recent explosion in biological and other real-world network data has created the need for improved tools for large network analyses. Small induced subgraphs, called graphlets, have been used to develop "network signatures" that summarize network topologies Based on these network signatures, two new highly sensitive measures of network local structural similarities were designed: the relative graphlet frequency distance (RGFdistance) and the graphlet degree distribution agreement (GDD-agreement). None of the existing tools compare real-world networks against a series of network models with respect to these local as well as a multitude of global network properties. Many other real-world phenomena have been described in terms of large networks ( called graphs), such as various types of social and technological networks Understanding these complex phenomena has become an important scientific problem that has lead to intensive research in network modeling and analyses. Local properties include network motifs, small over-represented subgraphs [3,4,5], and two measures based on graphlets, small induced subgraphs of large networks: the relative graphlet frequency distance (RGF-distance), which compares the frequencies of the appearance of graphlets in two networks [6], and the graphlet degree distribution agreement (GDD-agreement), which is a graphlet-based generalization of the degree distribution [7] (details are given in Section "Local network properties")

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call