Abstract

Metabolic network models are increasingly being used in health care and industry. As a consequence, many tools have been released to automate their reconstruction process de novo. In order to enable gene deletion simulations and integration of gene expression data, these networks must include gene-protein-reaction (GPR) rules, which describe with a Boolean logic relationships between the gene products (e.g., enzyme isoforms or subunits) associated with the catalysis of a given reaction. Nevertheless, the reconstruction of GPRs still remains a largely manual and time consuming process. Aiming at fully automating the reconstruction process of GPRs for any organism, we propose the open-source python-based framework GPRuler. By mining text and data from 9 different biological databases, GPRuler can reconstruct GPRs starting either from just the name of the target organism or from an existing metabolic model. The performance of the developed tool is evaluated at small-scale level for a manually curated metabolic model, and at genome-scale level for three metabolic models related to Homo sapiens and Saccharomyces cerevisiae organisms. By exploiting these models as benchmarks, the proposed tool shown its ability to reproduce the original GPR rules with a high level of accuracy. In all the tested scenarios, after a manual investigation of the mismatches between the rules proposed by GPRuler and the original ones, the proposed approach revealed to be in many cases more accurate than the original models. By complementing existing tools for metabolic network reconstruction with the possibility to reconstruct GPRs quickly and with a few resources, GPRuler paves the way to the study of context-specific metabolic networks, representing the active portion of the complete network in given conditions, for organisms of industrial or biomedical interest that have not been characterized metabolically yet.

Highlights

  • Current advances in genome sequencing technologies enable a fast and cheap overview into the genetic composition of virtually any organism

  • According to the type of relationships established among the genes involved in a given reaction, GPR rules can be categorized into five classes

  • We firstly assessed the performance of GPRuler in reconstructing the GPR rules of HMRcore model, which is a core model of central carbon metabolism that we extracted from the genome-wide HMR metabolic model [43] and we subsequently introduced and curated in [36, 44,45,46]

Read more

Summary

Introduction

Current advances in genome sequencing technologies enable a fast and cheap overview into the genetic composition of virtually any organism. Determining the global metabolic profile of a cell or organism is fundamental to provide a comprehensive readout of its functional state, resulting from the interplay between genome, biochemistry and environment. In this context, genome-scale metabolic models (GEMs) offer a systemic overview for the investigation of cell metabolic potential, because of their key feature of embracing all available knowledge about the biochemical transformations taking place in a given cell or organism [1]. Metabolic reactions occur either spontaneously or catalyzed by small molecules, implying that no gene is necessary for their catalysis. Enzymes differing in either their biological activity, regulatory properties, intracellular location, or spatio-temporal expression, may alternatively catalyze the same reaction and are known as isoforms [3]

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call