Generation of gene-protein-reaction association rules in genome scale metabolic models: Chronology, challenges, and future perspectives
Generation of gene-protein-reaction association rules in genome scale metabolic models: Chronology, challenges, and future perspectives
- Research Article
2
- 10.4155/pbp.13.37
- Oct 1, 2013
- Pharmaceutical Bioprocessing
Biotechnology is currently evolving through the era of big data, thanks to advances in the high-throughput technologies for rapid and inexpensive genome sequencing and other genome-wide studies [1]. With the daunting amount of data, it has been possible to put them together into a coherently organized biological network that provides counterintuitive insights on biological systems [2]. Among such biological networks, a genome-scale metabolic network model is expected to play an increasingly important role in the biopharmaceutical industry [3]. Before enumerating their specific strengths, it is important to note that principles underlying genome-scale metabolic network models are consistent with the holistic perspective of systems biology, the aim of which is to unveil hidden factors causing diseases and to find relevant treatment strategies [4]. Despite the importance of metabolism in a biological system, studies on diseases in relation to metabolism were far fewer in number than those performed on signaling and transcriptional regulatory networks [5]. However, metabolism, highly linked with observable phenotypes, is a biological network that is more comprehensively characterized when compared with the other two types of networks [6]. Metabolism is, therefore, amenable to large-scale mathematical modeling and simulation. It is with this motivation that the genome-scale metabolic simulation deserves more attention in drug discovery campaigns and optimization of a host strain for the production of biopharmaceuticals. Reconstruction and application of genome-scale metabolic network models have been forged as a major research strategy of systems biology. Over the last decade, genome-scale metabolic models have been built for almost all biologically important organisms across the domains of archea, bacteria and eukaryotes [3]. They range from simple micro organisms such as Escherichia coli [7] and Saccharomyces cerevisiae [8] to higher organisms including Chinese hamster ovary (CHO) cells [9,10] and a generic human cell [11,12]. It should be noted that all these organisms that have been subjected to metabolic modeling are important cellular hosts for biopharmaceutical production or medically meaningful organisms that need to be cured (e.g., specific cancer cells) or destroyed (e.g., pathogens). A recent notable development of importance in the genomescale metabolic modeling would be the newly updated human metabolic network Recon 2 [12]. Recon 2 is a result of efforts from a group of researchers, going over a vast amount of literature and biochemical data and reconciling conflicting information. Scope of the hitherto reconstructed genome-scale metabolic models manifest high expectations for their potential contributions to biopharmaceutical industry. Genome-scale metabolic network models are not just a simple pileup of biochemical reactions, but allow mathematical simulation under precisely defined conditions of constraints [13]. Once the experimentally Applications of genome-scale metabolic network models in the biopharmaceutical industry
- Dissertation
- 10.18174/416473
- Jul 4, 2017
Metabolic modeling to understand and redesign microbial systems
- Research Article
32
- 10.1038/s41598-020-63235-w
- Apr 10, 2020
- Scientific Reports
Since the first in silico generation of a genome-scale metabolic (GSM) model for Haemophilus influenzae in 1999, the GSM models have been reconstructed for various organisms including human and mouse. There are two important strategies for generating a GSM model: in the bottom-up approach, individual genomic and biochemical components are integrated to build a GSM model. Alternatively, the orthology-based strategy uses a previously reconstructed model of a reference organism to infer a GSM model of a target organism. Following the update and development of the metabolic network of reference organism, the model of the target organism can also be updated to eliminate defects. Here, we presented iMM1865 model as an orthology-based reconstruction of a GSM model for Mus musculus based on the last flux-consistent version of the human metabolic network, Recon3D. We proposed two versions of the new mouse model, iMM1865 and min-iMM1865, with the same number of gene-associated reactions but different subsets of non-gene-associated reactions. A third extended but flux-inconsistent model (iMM3254) was also created based on the extended version of Recon3D. Compared to the previously published mouse models, both versions of iMM1865 include more comprehensive annotations of metabolites and reactions with no dead-end metabolites and blocked reactions. We evaluated functionality of the models using 431 metabolic objective functions. iMM1865 and min-iMM1865 passed 93% and 87% of the tests, respectively, while iMM1415 and MMR (another available mouse GSM) passed 80% and 84% of the tests, respectively. Three versions of tissue-specific embryo heart models were also reconstructed from each of iMM1865 and min-iMM1865 using mCADRE algorithm with different thresholds on expression-based scores. The ability of corresponding GSM and embryo heart models to predict essential genes was assessed across experimentally derived lethal and viable gene sets. Our analysis revealed that tissue-specific models render much better predictions than GSM models.
- Research Article
10
- 10.1016/j.compbiomed.2023.106600
- Jan 25, 2023
- Computers in Biology and Medicine
Genome-scale community modeling for deciphering the inter-microbial metabolic interactions in fungus-farming termite gut microbiome
- Research Article
85
- 10.1016/j.ymben.2015.08.006
- Sep 8, 2015
- Metabolic Engineering
13C metabolic flux analysis at a genome-scale
- Research Article
10
- 10.1007/s43393-022-00115-6
- Jul 21, 2022
- Systems Microbiology and Biomanufacturing
Due to the increasing demand for microbially manufactured products in various industries, it has become important to find optimal designs for microbial cell factories by changing the direction of metabolic flow and its flux size by means of metabolic engineering such as knocking out competing pathways and introducing exogenous pathways to increase the yield of desired products. Recently, with the gradual cross-fertilization between computer science and bioinformatics fields, machine learning and intelligent optimization-based approaches have received much attention in Genome-scale metabolic network models (GSMMs) based on constrained optimization methods, and many high-quality related works have been published. Therefore, this paper focuses on the advances and applications of machine learning and intelligent optimization algorithms in metabolic engineering, with special emphasis on GSMMs. Specifically, the development history of GSMMs is first reviewed. Then, the analysis methods of GSMMs based on constraint optimization are presented. Next, this paper mainly reviews the development and application of machine learning and intelligent optimization algorithms in genome-scale metabolic models. In addition, the research gaps and future research potential in machine learning and intelligent optimization methods applied in GSMMs are discussed.
- Research Article
- 10.1109/tcbbio.2025.3590141
- Jan 1, 2025
- IEEE transactions on computational biology and bioinformatics
In genome-scale constraint-based metabolic models, gene deletion strategies are crucial for achieving growth-coupled production, where cell growth and target metabolite production are simultaneously achieved. While computational methods for calculating gene deletions have been widely explored and have contributed to gene deletion strategy databases, current approaches remain computationally demanding and have yet to fully leverage emerging data-driven paradigms, such as machine learning, for more efficient strain design. Therefore, it is necessary to propose a fundamental framework for this objective. In this study, we first formulate the problem of gene deletion strategy prediction and then propose a framework for predicting gene deletion strategies for growth-coupled production in genome-scale metabolic models. The proposed framework leverages deep learning algorithms to learn and integrate sequential gene and metabolite data representation, enabling the automatic gene deletion strategy prediction. Computational experiment results demonstrate the feasibility of the proposed framework, showing substantial improvements over baseline methods. Specifically, the proposed framework achieves a 14.69%, 22.52%, and 13.03% increase in overall accuracy across three metabolic models of different scales under study, while maintaining balanced precision and recall in predicting gene deletion statuses.
- Research Article
2
- 10.1016/j.bej.2023.108947
- Apr 25, 2023
- Biochemical Engineering Journal
Reconstruction and analysis of a genome-scale metabolic model for the gut bacteria Prevotella copri
- Research Article
20
- 10.1016/j.jbiotec.2017.04.004
- Apr 4, 2017
- Journal of Biotechnology
Genome-scale metabolic modelling common cofactors metabolism in microorganisms
- Book Chapter
3
- 10.1007/978-3-319-92958-3_4
- Jan 1, 2018
Genome-scale metabolic (GSM) models of plants have flourished over the last decade with advancements in both their scope and their applications. While the first plant models were mainly developed to represent a comprehensive set of all metabolic reactions that occur within a plant, organ- and tissue-specific models have been developed and recently these models have been linked to create whole-plant metabolic models. GSM models provide a promising path to predict the effect of genetic and environmental perturbations on metabolism. These models capture the interplay between carbon and nitrogen metabolism, which is important in designing genetic manipulations that improve nitrogen use efficiency (NUE). There is also potential to apply and adapt the diverse set of algorithms developed for microbial GSM models to plants. These algorithms have yielded numerous success stories in predicting the metabolic effects of genetic manipulations by identifying strategies for over-producing chemicals, and driving discovery. Furthering the development of plant GSM models and associated algorithmic tools is expected to have a large impact on predicting manipulations to improve plant traits such as NUE.
- Research Article
14
- 10.1007/s12257-019-0208-1
- Feb 1, 2020
- Biotechnology and Bioprocess Engineering
Genome-scale metabolic models (GEMs) are powerful tools for predicting metabolic flux distributions, understanding complex cell physiology, and guiding the improvement of cell metabolism and production. Yarrowia lipolytica is known for its ability to accumulate lipids and has been widely employed to produce many important metabolites as an ideal host microorganism. There are six GEMs reconstructed for this strain by different research groups, which, however, may cause confusion for model users. It is therefore necessary to understand and analyze the existing models comprehensively. Different simulation results of the published GEMs of Y. lipolytica were analyzed based on experimental data, in order to understand the differences among models and identify whether there were common problems in model construction. First, specific growth rates (μ) under various culture conditions were simulated by different models, showing that the biomass generation equation in models had significant influence on the accuracy of simulation results. In addition, simulation and analysis of intracellular flux distributions revealed several inaccurate descriptions on the reversibility of reactions involving currency metabolites in the models. Finally, specific metabolite formation rates were predicted for different target products, and large discrepancies among the different models were observed. The corresponding solutions were then proposed according to the findings of the above model problems. We have corrected the existing GEMs of Y. lipolytica and the prediction performances of the models have been significantly improved. Several suggestions for better construction and refinement of genome-scale metabolic network models were also provided.
- Book Chapter
54
- 10.1016/b978-0-12-385118-5.00024-4
- Jan 1, 2011
- Methods in Enzymology
Chapter twenty-four - A Practical Guide to Genome-Scale Metabolic Models and Their Analysis
- Research Article
20
- 10.1186/s12918-017-0434-0
- Jun 1, 2017
- BMC systems biology
BackgroundThe increase in glycerol obtained as a byproduct of biodiesel has encouraged the production of new industrial products, such as 1,3-propanediol (PDO), using biotechnological transformation via bacteria like Clostridium butyricum. However, despite the increasing role of Clostridium butyricum as a bio-production platform, its metabolism remains poorly modeled.ResultsWe reconstructed iCbu641, the first genome-scale metabolic (GSM) model of a PDO producer Clostridium strain, which included 641 genes, 365 enzymes, 891 reactions, and 701 metabolites. We found an enzyme expression prediction of nearly 84% after comparison of proteomic data with flux distribution estimation using flux balance analysis (FBA). The remaining 16% corresponded to enzymes directionally coupled to growth, according to flux coupling findings (FCF). The fermentation data validation also revealed different phenotype states that depended on culture media conditions; for example, Clostridium maximizes its biomass yield per enzyme usage under glycerol limitation. By contrast, under glycerol excess conditions, Clostridium grows sub-optimally, maximizing biomass yield while minimizing both enzyme usage and ATP production. We further evaluated perturbations in the GSM model through enzyme deletions and variations in biomass composition. The GSM predictions showed no significant increase in PDO production, suggesting a robustness to perturbations in the GSM model. We used the experimental results to predict that co-fermentation was a better alternative than iCbu641 perturbations for improving PDO yields.ConclusionsThe agreement between the predicted and experimental values allows the use of the GSM model constructed for the PDO-producing Clostridium strain to propose new scenarios for PDO production, such as dynamic simulations, thereby reducing the time and costs associated with experimentation.
- Research Article
33
- 10.1016/j.ymben.2021.06.005
- Jun 24, 2021
- Metabolic Engineering
Integrating thermodynamic and enzymatic constraints into genome-scale metabolic models
- Research Article
- 10.13345/j.cjb.210335
- Feb 25, 2022
- Sheng wu gong cheng xue bao = Chinese journal of biotechnology
Constraint-based genome-scale metabolic network models (genome-scale metabolic models, GEMs) have been widely used to predict metabolic phenotypes. In addition to stoichiometric constraints, other constraints such as enzyme availability and thermodynamic feasibility may also limit the cellular phenotype solution space. Recently, extended GEM models considering either enzymatic or thermodynamic constraints have been developed to improve model prediction accuracy. This review summarizes the recent progresses on metabolic models with multiple constraints (MCGEMs). We presented the construction methods and various applications of MCGEMs including the simulation of gene knockout, prediction of biologically feasible pathways and identification of bottleneck steps. By integrating multiple constraints in a consistent modeling framework, MCGEMs can predict the metabolic bottlenecks and key controlling and modification targets for pathway optimization more precisely, and thus may provide more reliable design results to guide metabolic engineering of industrially important microorganisms.
- Ask R Discovery
- Chat PDF
AI summaries and top papers from 250M+ research sources.