Abstract

BackgroundUnderstanding the information-processing capabilities of signal transduction networks, how those networks are disrupted in disease, and rationally designing therapies to manipulate diseased states require systematic and accurate reconstruction of network topology. Data on networks central to human physiology, such as the inflammatory signalling networks analyzed here, are found in a multiplicity of on-line resources of pathway and interactome databases (Cancer CellMap, GeneGo, KEGG, NCI-Pathway Interactome Database (NCI-PID), PANTHER, Reactome, I2D, and STRING). We sought to determine whether these databases contain overlapping information and whether they can be used to construct high reliability prior knowledge networks for subsequent modeling of experimental data.ResultsWe have assembled an ensemble network from multiple on-line sources representing a significant portion of all machine-readable and reconcilable human knowledge on proteins and protein interactions involved in inflammation. This ensemble network has many features expected of complex signalling networks assembled from high-throughput data: a power law distribution of both node degree and edge annotations, and topological features of a “bow tie” architecture in which diverse pathways converge on a highly conserved set of enzymatic cascades focused around PI3K/AKT, MAPK/ERK, JAK/STAT, NFκB, and apoptotic signaling. Individual pathways exhibit “fuzzy” modularity that is statistically significant but still involving a majority of “cross-talk” interactions. However, we find that the most widely used pathway databases are highly inconsistent with respect to the actual constituents and interactions in this network. Using a set of growth factor signalling networks as examples (epidermal growth factor, transforming growth factor-beta, tumor necrosis factor, and wingless), we find a multiplicity of network topologies in which receptors couple to downstream components through myriad alternate paths. Many of these paths are inconsistent with well-established mechanistic features of signalling networks, such as a requirement for a transmembrane receptor in sensing extracellular ligands.ConclusionsWide inconsistencies among interaction databases, pathway annotations, and the numbers and identities of nodes associated with a given pathway pose a major challenge for deriving causal and mechanistic insight from network graphs. We speculate that these inconsistencies are at least partially attributable to cell, and context-specificity of cellular signal transduction, which is largely unaccounted for in available databases, but the absence of standardized vocabularies is an additional confounding factor. As a result of discrepant annotations, it is very difficult to identify biologically meaningful pathways from interactome networks a priori. However, by incorporating prior knowledge, it is possible to successively build out network complexity with high confidence from a simple linear signal transduction scaffold. Such reduced complexity networks appear suitable for use in mechanistic models while being richer and better justified than the simple linear pathways usually depicted in diagrams of signal transduction.

Highlights

  • Understanding the information-processing capabilities of signal transduction networks, how those networks are disrupted in disease, and rationally designing therapies to manipulate diseased states require systematic and accurate reconstruction of network topology

  • We identified seven interactome databases, partially overlapping with the pathway databases, for which it was possible to extract machine readable interactions in Cytoscape’s Simple Interaction Format (SIF) [27] or analogous tabular formats: a meta-database of proteinprotein interactions (PPI) (Interologous Interaction Database; I2D) [28], an integrated text-mining meta-database (STRING) [29] and five of the expert-curated databases listed above (Cancer Cell Map (CellMap), GeneGo, National Cancer Institute Pathway Interactome Database (NCI-PID), Reactome, and Macrophage)

  • While many databases contain representations of “canonical” signalling pathways, it is not clear how consistent the definition of pathways is. To examine this we focused on 4 extensively studied, and presumably well-defined signalling pathways lying downstream of Epidermal Growth Factor (EGF), Transforming Growth Factor-β (TGF-β), Tumour Necrosis Factor-α (TNF-α), and Wingless (Wnt; Figure 3BE)

Read more

Summary

Introduction

Understanding the information-processing capabilities of signal transduction networks, how those networks are disrupted in disease, and rationally designing therapies to manipulate diseased states require systematic and accurate reconstruction of network topology. The information in Bayesian nets or graphs assembled using mutual information, regression or physical association is almost entirely topological Such models capture sets of interactions involving hundreds or thousands of biomolecules and can reveal how disease processes affect large sets of molecular [12] and cellular interactions [13]. They typically convert interaction networks into computable models and train the models against experimental data [9,14,15,16] Based on these models, it seems likely large-scale interaction databases represent the totality of all possible interactions that might occur between biomolecules, ignoring important cell- and context-specific differences. Manual approaches are biased and excessively restrictive in terms of the numbers of nodes and interactions and automated approaches to PKN assembly are clearly required

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.