Biological network comparison using graphlet degree distribution

Nataša Pržulj

doi:10.1093/bioinformatics/btl301

Abstract

Analogous to biological sequence comparison, comparing cellular networks is an important problem that could provide insight into biological understanding and therapeutics. For technical reasons, comparing large networks is computationally infeasible, and thus heuristics, such as the degree distribution, clustering coefficient, diameter, and relative graphlet frequency distribution have been sought. It is easy to demonstrate that two networks are different by simply showing a short list of properties in which they differ. It is much harder to show that two networks are similar, as it requires demonstrating their similarity in all of their exponentially many properties. Clearly, it is computationally prohibitive to analyze all network properties, but the larger the number of constraints we impose in determining network similarity, the more likely it is that the networks will truly be similar. We introduce a new systematic measure of a network's local structure that imposes a large number of similarity constraints on networks being compared. In particular, we generalize the degree distribution, which measures the number of nodes 'touching' k edges, into distributions measuring the number of nodes 'touching' k graphlets, where graphlets are small connected non-isomorphic subgraphs of a large network. Our new measure of network local structure consists of 73 graphlet degree distributions of graphlets with 2-5 nodes, but it is easily extendible to a greater number of constraints (i.e. graphlets), if necessary, and the extensions are limited only by the available CPU. Furthermore, we show a way to combine the 73 graphlet degree distributions into a network 'agreement' measure which is a number between 0 and 1, where 1 means that networks have identical distributions and 0 means that they are far apart. Based on this new network agreement measure, we show that almost all of the 14 eukaryotic PPI networks, including human, resulting from various high-throughput experimental techniques, as well as from curated databases, are better modeled by geometric random graphs than by Erdös-Rény, random scale-free, or Barabási-Albert scale-free networks. Software executables are available upon request.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Biological network comparison using graphlet degree distribution

Abstract

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Journal: Bioinformatics	Publication Date: Jan 15, 2007
Citations: 823

Similar Papers

Effects of local structure of neuronal networks on spiking activity in silico
Tuomo Mäki-Marttunen ... Marja-Leena Linne
BMC Neuroscience | VOL. 12
Tuomo Mäki-Marttunen, et. al.Tuomo Mäki-Marttunen ... Marja-Leena Linne
18 Jul 2011
BMC Neuroscience | VOL. 12

Modeling interactome: scale-free or geometric?
N Pržulj ... I Jurisica
Bioinformatics | VOL. 20
N Pržulj, et. al.N Pržulj ... I Jurisica
29 Jul 2004
Bioinformatics | VOL. 20

Efficient estimation of graphlet frequency distributions in protein–protein interaction networks
N Pržulj ... D G Corneil
Bioinformatics | VOL. 22
N Pržulj, et. al.N Pržulj ... D G Corneil
01 Feb 2006
Bioinformatics | VOL. 22

Generative Graph Models based on Laplacian Spectra?
Alana Shine ... David Kempe
-
Alana Shine, et. al.Alana Shine ... David Kempe
13 May 2019
13 May 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Biological network comparison using graphlet degree distribution

Abstract

Talk to us

Similar Papers

More From: Bioinformatics