Creating and analyzing pathway and protein interaction compendia for modelling signal transduction networks

Daniel C Kirouac,Peter K Sorger,Douglas A Lauffenburger,Jennifer Swantek,Julio Saez-Rodriguez,John M Burke

doi:10.1186/1752-0509-6-29

Abstract

BackgroundUnderstanding the information-processing capabilities of signal transduction networks, how those networks are disrupted in disease, and rationally designing therapies to manipulate diseased states require systematic and accurate reconstruction of network topology. Data on networks central to human physiology, such as the inflammatory signalling networks analyzed here, are found in a multiplicity of on-line resources of pathway and interactome databases (Cancer CellMap, GeneGo, KEGG, NCI-Pathway Interactome Database (NCI-PID), PANTHER, Reactome, I2D, and STRING). We sought to determine whether these databases contain overlapping information and whether they can be used to construct high reliability prior knowledge networks for subsequent modeling of experimental data.ResultsWe have assembled an ensemble network from multiple on-line sources representing a significant portion of all machine-readable and reconcilable human knowledge on proteins and protein interactions involved in inflammation. This ensemble network has many features expected of complex signalling networks assembled from high-throughput data: a power law distribution of both node degree and edge annotations, and topological features of a “bow tie” architecture in which diverse pathways converge on a highly conserved set of enzymatic cascades focused around PI3K/AKT, MAPK/ERK, JAK/STAT, NFκB, and apoptotic signaling. Individual pathways exhibit “fuzzy” modularity that is statistically significant but still involving a majority of “cross-talk” interactions. However, we find that the most widely used pathway databases are highly inconsistent with respect to the actual constituents and interactions in this network. Using a set of growth factor signalling networks as examples (epidermal growth factor, transforming growth factor-beta, tumor necrosis factor, and wingless), we find a multiplicity of network topologies in which receptors couple to downstream components through myriad alternate paths. Many of these paths are inconsistent with well-established mechanistic features of signalling networks, such as a requirement for a transmembrane receptor in sensing extracellular ligands.ConclusionsWide inconsistencies among interaction databases, pathway annotations, and the numbers and identities of nodes associated with a given pathway pose a major challenge for deriving causal and mechanistic insight from network graphs. We speculate that these inconsistencies are at least partially attributable to cell, and context-specificity of cellular signal transduction, which is largely unaccounted for in available databases, but the absence of standardized vocabularies is an additional confounding factor. As a result of discrepant annotations, it is very difficult to identify biologically meaningful pathways from interactome networks a priori. However, by incorporating prior knowledge, it is possible to successively build out network complexity with high confidence from a simple linear signal transduction scaffold. Such reduced complexity networks appear suitable for use in mechanistic models while being richer and better justified than the simple linear pathways usually depicted in diagrams of signal transduction.

Highlights

Understanding the information-processing capabilities of signal transduction networks, how those networks are disrupted in disease, and rationally designing therapies to manipulate diseased states require systematic and accurate reconstruction of network topology
We identified seven interactome databases, partially overlapping with the pathway databases, for which it was possible to extract machine readable interactions in Cytoscape’s Simple Interaction Format (SIF) [27] or analogous tabular formats: a meta-database of proteinprotein interactions (PPI) (Interologous Interaction Database; I2D) [28], an integrated text-mining meta-database (STRING) [29] and five of the expert-curated databases listed above (Cancer Cell Map (CellMap), GeneGo, National Cancer Institute Pathway Interactome Database (NCI-PID), Reactome, and Macrophage)
While many databases contain representations of “canonical” signalling pathways, it is not clear how consistent the definition of pathways is. To examine this we focused on 4 extensively studied, and presumably well-defined signalling pathways lying downstream of Epidermal Growth Factor (EGF), Transforming Growth Factor-β (TGF-β), Tumour Necrosis Factor-α (TNF-α), and Wingless (Wnt; Figure 3BE)

Summary

Introduction

Understanding the information-processing capabilities of signal transduction networks, how those networks are disrupted in disease, and rationally designing therapies to manipulate diseased states require systematic and accurate reconstruction of network topology. The information in Bayesian nets or graphs assembled using mutual information, regression or physical association is almost entirely topological Such models capture sets of interactions involving hundreds or thousands of biomolecules and can reveal how disease processes affect large sets of molecular [12] and cellular interactions [13]. They typically convert interaction networks into computable models and train the models against experimental data [9,14,15,16] Based on these models, it seems likely large-scale interaction databases represent the totality of all possible interactions that might occur between biomolecules, ignoring important cell- and context-specific differences. Manual approaches are biased and excessively restrictive in terms of the numbers of nodes and interactions and automated approaches to PKN assembly are clearly required

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Systems Biology	Publication Date: May 1, 2012
Citations: 135	License type: cc-by

R Discovery Prime

R Discovery Prime

Creating and analyzing pathway and protein interaction compendia for modelling signal transduction networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Systems Biology

Lead the way for us

Similar Papers

Proteome-wide Prediction of Signal Flow Direction in Protein Interaction Networks Based on Interacting Domains
Wei Liu ... Fuchu He
Molecular & Cellular Proteomics | VOL. 8
Wei Liu, et. al.Wei Liu ... Fuchu He
01 Sep 2009
Molecular & Cellular Proteomics | VOL. 8

Keratinocyte Growth Inhibition by High-Dose Epidermal Growth Factor Is Mediated by Transforming Growth Factor β Autoinduction: A Negative Feedback Mechanism for Keratinocyte Growth
Kenshi Yamasaki ... Koji Hashimoto
Journal of Investigative Dermatology | VOL. 120
Kenshi Yamasaki, et. al.Kenshi Yamasaki ... Koji Hashimoto
01 Jun 2003
Journal of Investigative Dermatology | VOL. 120

Postnatal skeletal muscle myogenesis governed by signal transduction networks: MAPKs and PI3K–Akt control multiple steps
Takeshi Endo
Biochemical and Biophysical Research Communications | VOL. 682
Takeshi EndoTakeshi Endo
27 Sep 2023
Biochemical and Biophysical Research Communications | VOL. 682

Network quantification of EGFR signaling unveils potential for targeted combination therapy
Bertram Klinger ... Anja Sieber
Molecular Systems Biology | VOL. 9
Bertram Klinger, et. al.Bertram Klinger ... Anja Sieber
01 Jan 2013
Molecular Systems Biology | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Creating and analyzing pathway and protein interaction compendia for modelling signal transduction networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Systems Biology