Abstract

Accurate modelling of biological systems requires a deeper and more complete knowledge about the molecular components and their functional associations than we currently have. Traditionally, new knowledge on protein associations generated by experiments has played a central role in systems modelling, in contrast to generally less trusted bio-computational predictions. However, we will not achieve realistic modelling of complex molecular systems if the current experimental designs lead to biased screenings of real protein networks and leave large, functionally important areas poorly characterised. To assess the likelihood of this, we have built comprehensive network models of the yeast and human proteomes by using a meta-statistical integration of diverse computationally predicted protein association datasets. We have compared these predicted networks against combined experimental datasets from seven biological resources at different level of statistical significance. These eukaryotic predicted networks resemble all the topological and noise features of the experimentally inferred networks in both species, and we also show that this observation is not due to random behaviour. In addition, the topology of the predicted networks contains information on true protein associations, beyond the constitutive first order binary predictions. We also observe that most of the reliable predicted protein associations are experimentally uncharacterised in our models, constituting the hidden or “dark matter” of networks by analogy to astronomical systems. Some of this dark matter shows enrichment of particular functions and contains key functional elements of protein networks, such as hubs associated with important functional areas like the regulation of Ras protein signal transduction in human cells. Thus, characterising this large and functionally important dark matter, elusive to established experimental designs, may be crucial for modelling biological systems. In any case, these predictions provide a valuable guide to these experimentally elusive regions.

Highlights

  • Many features of biological systems cannot be inferred from a simple sum of their components but rather emerge as network properties [1]

  • To model accurate protein networks we need to extend our knowledge of protein associations in molecular systems much further

  • We suggest that the majority of predicted network space is dark matter containing important functional areas, elusive to current experimental designs

Read more

Summary

Introduction

Many features of biological systems cannot be inferred from a simple sum of their components but rather emerge as network properties [1]. Organisms comprise systems of highly integrated networks or ‘accelerating networks’ [2] in which all components (proteins, lipids, minerals, water, etc.) are integrated and coordinated in time and space. Given such complexity, the gaps in our current knowledge prevent us from modelling complete living organisms [3,4]. The scarce knowledge of biological systems is further compounded by experimental error It is common for different highthroughput experimental approaches, applied to the same biological system, to yield different outcomes, resulting in protein networks with different topological and biological properties [4]. It has been demonstrated that high-throughput yeast two-hybrid (HT-Y2H) interactions for human proteins are more precise than literature-curated interactions supported by a single publication [6]

Objectives
Methods
Results
Discussion
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.