Protein Interaction Graphs Research Articles

Proteins are vital biological molecules driving many fundamental cellular processes. They rarely act alone, but form interacting groups called protein complexes. The study of protein complexes is a key goal in systems biology. Recently, large protein-protein interaction (PPI) datasets have been published and a plethora of computational methods that provide new ideas for the prediction of protein complexes have been implemented. However, most of the methods suffer from two major limitations: First, they do not account for proteins participating in multiple functions and second, they are unable to handle weighted PPI graphs. Moreover, the problem remains open as existing algorithms and tools are insufficient in terms of predictive metrics. In the present paper, we propose gradually expanding neighborhoods with adjustment (GENA), a new algorithm that gradually expands neighborhoods in a graph starting from highly informative "seed" nodes. GENA considers proteins as multifunctional molecules allowing them to participate in more than one protein complex. In addition, GENA accepts weighted PPI graphs by using a weighted evaluation function for each cluster. In experiments with datasets from Saccharomyces cerevisiae and human, GENA outperformed Markov clustering, restricted neighborhood search and clustering with overlapping neighborhood expansion, three state-of-the-art methods for computationally predicting protein complexes. Seven PPI networks and seven evaluation datasets were used in total. GENA outperformed existing methods in 16 out of 18 experiments achieving an average improvement of 5.5% when the maximum matching ratio metric was used. Our method was able to discover functionally homogeneous protein clusters and uncover important network modules in a Parkinson expression dataset. When used on the human networks, around 47% of the detected clusters were enriched in gene ontology (GO) terms with depth higher than five in the GO hierarchy. In the present manuscript, we introduce a new method for the computational prediction of protein complexes by making the realistic assumption that proteins participate in multiple protein complexes and cellular functions. Our method can detect accurate and functionally homogeneous clusters.

BackgroundGenome scale data on protein interactions are generally represented as large networks, or graphs, where hundreds or thousands of proteins are linked to one another. Since proteins tend to function in groups, or complexes, an important goal has been to reliably identify protein complexes from these graphs. This task is commonly executed using clustering procedures, which aim at detecting densely connected regions within the interaction graphs. There exists a wealth of clustering algorithms, some of which have been applied to this problem. One of the most successful clustering procedures in this context has been the Markov Cluster algorithm (MCL), which was recently shown to outperform a number of other procedures, some of which were specifically designed for partitioning protein interactions graphs. A novel promising clustering procedure termed Affinity Propagation (AP) was recently shown to be particularly effective, and much faster than other methods for a variety of problems, but has not yet been applied to partition protein interaction graphs.ResultsIn this work we compare the performance of the Affinity Propagation (AP) and Markov Clustering (MCL) procedures. To this end we derive an unweighted network of protein-protein interactions from a set of 408 protein complexes from S. cervisiae hand curated in-house, and evaluate the performance of the two clustering algorithms in recalling the annotated complexes. In doing so the parameter space of each algorithm is sampled in order to select optimal values for these parameters, and the robustness of the algorithms is assessed by quantifying the level of complex recall as interactions are randomly added or removed to the network to simulate noise. To evaluate the performance on a weighted protein interaction graph, we also apply the two algorithms to the consolidated protein interaction network of S. cerevisiae, derived from genome scale purification experiments and to versions of this network in which varying proportions of the links have been randomly shuffled.ConclusionOur analysis shows that the MCL procedure is significantly more tolerant to noise and behaves more robustly than the AP algorithm. The advantage of MCL over AP is dramatic for unweighted protein interaction graphs, as AP displays severe convergence problems on the majority of the unweighted graph versions that we tested, whereas MCL continues to identify meaningful clusters, albeit fewer of them, as the level of noise in the graph increases. MCL thus remains the method of choice for identifying protein complexes from binary interaction networks.

Protein Interaction Graphs Research Articles

Related Topics

Articles published on Protein Interaction Graphs

ArcMatch: high-performance subgraph matching for labeled graphs by exploiting edge domains

PPICT: an integrated deep neural network for predicting inter-protein PTM cross-talk.

Building Protein-Protein Interaction Graph Database Using Neo4j.

Peripheral blood exosomes from patients with multiple myeloma mediate bortezomib resistance in cultured multiple myeloma cells

A network-based zoning for parallel whole-cell simulation.

Predicting overlapping protein complexes from weighted protein interaction graphs by gradually expanding dense neighborhoods.

Large-scale identification of potential drug targets based on the topological features of human protein–protein interaction network

Finding Bicliques in Digraphs: Application into Viral-host Protein Interactome

Finding Bicliques in Digraphs: Application into Viral-host Protein Interactome

Using a Genetic Algorithm and Markov Clustering on Protein–Protein Interaction Graphs

Spa: A Semi-SupervisedRPackage for Semi-Parametric Graph-Based Estimation

Discovering pathways by orienting edges in protein interaction networks

Computational approaches for detecting protein complexes from protein interaction networks: a survey

Complexity issues in color-preserving graph embeddings

Markov clustering versus affinity propagation for the partitioning of protein interaction graphs

Revealing Biological Modules via Graph Summarization

Finding occurrences of protein complexes in protein–protein interaction graphs

Protein complex identification by supervised graph local clustering

Integrating protein-protein interactions and text mining for protein function prediction

Evolution of insect proteomes: insights into synapse organization and synaptic vesicle life cycle

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Protein Interaction Graphs Research Articles

Related Topics

Articles published on Protein Interaction Graphs

ArcMatch: high-performance subgraph matching for labeled graphs by exploiting edge domains

PPICT: an integrated deep neural network for predicting inter-protein PTM cross-talk.

Building Protein-Protein Interaction Graph Database Using Neo4j.

Peripheral blood exosomes from patients with multiple myeloma mediate bortezomib resistance in cultured multiple myeloma cells

A network-based zoning for parallel whole-cell simulation.

Predicting overlapping protein complexes from weighted protein interaction graphs by gradually expanding dense neighborhoods.

Large-scale identification of potential drug targets based on the topological features of human protein–protein interaction network

Finding Bicliques in Digraphs: Application into Viral-host Protein Interactome

Finding Bicliques in Digraphs: Application into Viral-host Protein Interactome

Using a Genetic Algorithm and Markov Clustering on Protein–Protein Interaction Graphs

Spa: A Semi-SupervisedRPackage for Semi-Parametric Graph-Based Estimation

Discovering pathways by orienting edges in protein interaction networks

Computational approaches for detecting protein complexes from protein interaction networks: a survey

Complexity issues in color-preserving graph embeddings

Markov clustering versus affinity propagation for the partitioning of protein interaction graphs

Revealing Biological Modules via Graph Summarization

Finding occurrences of protein complexes in protein–protein interaction graphs

Protein complex identification by supervised graph local clustering

Integrating protein-protein interactions and text mining for protein function prediction

Evolution of insect proteomes: insights into synapse organization and synaptic vesicle life cycle