Applications of genome-scale metabolic network models in the biopharmaceutical industry
Biotechnology is currently evolving through the era of big data, thanks to advances in the high-throughput technologies for rapid and inexpensive genome sequencing and other genome-wide studies [1]. With the daunting amount of data, it has been possible to put them together into a coherently organized biological network that provides counterintuitive insights on biological systems [2]. Among such biological networks, a genome-scale metabolic network model is expected to play an increasingly important role in the biopharmaceutical industry [3]. Before enumerating their specific strengths, it is important to note that principles underlying genome-scale metabolic network models are consistent with the holistic perspective of systems biology, the aim of which is to unveil hidden factors causing diseases and to find relevant treatment strategies [4]. Despite the importance of metabolism in a biological system, studies on diseases in relation to metabolism were far fewer in number than those performed on signaling and transcriptional regulatory networks [5]. However, metabolism, highly linked with observable phenotypes, is a biological network that is more comprehensively characterized when compared with the other two types of networks [6]. Metabolism is, therefore, amenable to large-scale mathematical modeling and simulation. It is with this motivation that the genome-scale metabolic simulation deserves more attention in drug discovery campaigns and optimization of a host strain for the production of biopharmaceuticals. Reconstruction and application of genome-scale metabolic network models have been forged as a major research strategy of systems biology. Over the last decade, genome-scale metabolic models have been built for almost all biologically important organisms across the domains of archea, bacteria and eukaryotes [3]. They range from simple micro organisms such as Escherichia coli [7] and Saccharomyces cerevisiae [8] to higher organisms including Chinese hamster ovary (CHO) cells [9,10] and a generic human cell [11,12]. It should be noted that all these organisms that have been subjected to metabolic modeling are important cellular hosts for biopharmaceutical production or medically meaningful organisms that need to be cured (e.g., specific cancer cells) or destroyed (e.g., pathogens). A recent notable development of importance in the genomescale metabolic modeling would be the newly updated human metabolic network Recon 2 [12]. Recon 2 is a result of efforts from a group of researchers, going over a vast amount of literature and biochemical data and reconciling conflicting information. Scope of the hitherto reconstructed genome-scale metabolic models manifest high expectations for their potential contributions to biopharmaceutical industry. Genome-scale metabolic network models are not just a simple pileup of biochemical reactions, but allow mathematical simulation under precisely defined conditions of constraints [13]. Once the experimentally Applications of genome-scale metabolic network models in the biopharmaceutical industry
1659
- 10.1038/nprot.2009.203
- Jan 1, 2010
- Nature Protocols
115
- 10.1073/pnas.0909181107
- Jan 5, 2010
- Proceedings of the National Academy of Sciences
85
- 10.1371/annotation/5594348b-de00-446a-bdd0-ec56e70b3553
- Jun 10, 2008
- PLoS Computational Biology
3592
- 10.1038/nbt.1614
- Mar 1, 2010
- Nature Biotechnology
19
- 10.1038/msb.2011.63
- Jan 1, 2011
- Molecular Systems Biology
253
- 10.3389/fphys.2012.00299
- Aug 6, 2012
- Frontiers in Physiology
54
- 10.1002/biot.201100159
- Nov 29, 2011
- Biotechnology Journal
1361
- 10.1073/pnas.0610772104
- Feb 6, 2007
- Proceedings of the National Academy of Sciences
195
- 10.1002/bit.24445
- Jan 31, 2012
- Biotechnology and Bioengineering
111
- 10.1186/1752-0509-7-36
- Apr 30, 2013
- BMC Systems Biology
- Research Article
24
- 10.1016/j.ymben.2021.03.013
- Mar 24, 2021
- Metabolic Engineering
Systematic evaluation of parameters for genome-scale metabolic models of cultured mammalian cells
- Preprint Article
3
- 10.1101/2020.06.24.169938
- Jun 26, 2020
Abstract Genome-scale metabolic models describe cellular metabolism with mechanistic detail. Given their high complexity, such models need to be parameterized correctly to yield accurate predictions and avoid overfitting. Effective parameterization has been well-studied for microbial models, but it remains unclear for higher eukaryotes, including mammalian cells. To address this, we enumerated model parameters that describe key features of cultured mammalian cells – including cellular composition, bioprocess performance metrics, mammalian-specific pathways, and biological assumptions behind model formulation approaches. We tested these parameters by building thousands of metabolic models and evaluating their ability to predict the growth rates of a panel of phenotypically diverse Chinese Hamster Ovary cell clones. We found the following considerations to be most critical for accurate parameterization: (1) cells limit metabolic activity to maintain homeostasis, (2) cell morphology and viability change dynamically during a growth curve, and (3) cellular biomass has a particular macromolecular composition. Depending on parameterization, models predicted different metabolic phenotypes, including contrasting mechanisms of nutrient utilization and energy generation, leading to varying accuracies of growth rate predictions. Notably, accurate parameter values broadly agreed with experimental measurements. These insights will guide future investigations of mammalian metabolism.
- Research Article
151
- 10.1186/1471-2105-7-512
- Nov 23, 2006
- BMC Bioinformatics
BackgroundThe availability of genome sequences for many organisms enabled the reconstruction of several genome-scale metabolic network models. Currently, significant efforts are put into the automated reconstruction of such models. For this, several computational tools have been developed that particularly assist in identifying and compiling the organism-specific lists of metabolic reactions. In contrast, the last step of the model reconstruction process, which is the definition of the thermodynamic constraints in terms of reaction directionalities, still needs to be done manually. No computational method exists that allows for an automated and systematic assignment of reaction directions in genome-scale models.ResultsWe present an algorithm that – based on thermodynamics, network topology and heuristic rules – automatically assigns reaction directions in metabolic models such that the reaction network is thermodynamically feasible with respect to the production of energy equivalents. It first exploits all available experimentally derived Gibbs energies of formation to identify irreversible reactions. As these thermodynamic data are not available for all metabolites, in a next step, further reaction directions are assigned on the basis of network topology considerations and thermodynamics-based heuristic rules. Briefly, the algorithm identifies reaction subsets from the metabolic network that are able to convert low-energy co-substrates into their high-energy counterparts and thus net produce energy. Our algorithm aims at disabling such thermodynamically infeasible cyclic operation of reaction subnetworks by assigning reaction directions based on a set of thermodynamics-derived heuristic rules. We demonstrate our algorithm on a genome-scale metabolic model of E. coli. The introduced systematic direction assignment yielded 130 irreversible reactions (out of 920 total reactions), which corresponds to about 70% of all irreversible reactions that are required to disable thermodynamically infeasible energy production.ConclusionAlthough not being fully comprehensive, our algorithm for systematic reaction direction assignment could define a significant number of irreversible reactions automatically with low computational effort. We envision that the presented algorithm is a valuable part of a computational framework that assists the automated reconstruction of genome-scale metabolic models.
- Research Article
9
- 10.1007/s43393-022-00115-6
- Jul 21, 2022
- Systems Microbiology and Biomanufacturing
Due to the increasing demand for microbially manufactured products in various industries, it has become important to find optimal designs for microbial cell factories by changing the direction of metabolic flow and its flux size by means of metabolic engineering such as knocking out competing pathways and introducing exogenous pathways to increase the yield of desired products. Recently, with the gradual cross-fertilization between computer science and bioinformatics fields, machine learning and intelligent optimization-based approaches have received much attention in Genome-scale metabolic network models (GSMMs) based on constrained optimization methods, and many high-quality related works have been published. Therefore, this paper focuses on the advances and applications of machine learning and intelligent optimization algorithms in metabolic engineering, with special emphasis on GSMMs. Specifically, the development history of GSMMs is first reviewed. Then, the analysis methods of GSMMs based on constraint optimization are presented. Next, this paper mainly reviews the development and application of machine learning and intelligent optimization algorithms in genome-scale metabolic models. In addition, the research gaps and future research potential in machine learning and intelligent optimization methods applied in GSMMs are discussed.
- Research Article
33
- 10.1016/j.ymben.2021.06.005
- Jun 24, 2021
- Metabolic Engineering
Integrating thermodynamic and enzymatic constraints into genome-scale metabolic models
- Research Article
2
- 10.1360/tb-2020-1468
- Feb 3, 2021
- Chinese Science Bulletin
The genome-scale metabolic network model (GEM) is a mathematical framework based on gene-protein-reaction associations combined with stoichiometric balance and is capable of facilitating the computation and prediction of multiscale phenotypes by optimizing the objective function of interest. It has been increasingly used as an important tool for understanding cellular metabolism and characterizing cell phenotypes. In cells, metabolism is tightly controlled by intricate regulatory mechanisms at the different system levels and is strictly regulated to ensure the dynamic adaptation of biochemical reaction fluxes for maintaining cell homeostasis to ultimately achieve optimal metabolic fitness. Advances in high-throughput screening and analysis technologies have generated massive amounts of genome sequences, along with transcriptomic, proteomic and metabolomic data, providing quantitative regulatory information to gain insights into cellular metabolism; however, integrating the available omics data into constraint-based metabolic models and quantitatively profiling genotype-phenotype relationships remains an outstanding challenge for computational biology. Here, we describe the recent developments in introducing macromolecular expression into GEMs and generating metabolic expression (ME) models, which increase the complexity and predictive capability of computational frameworks. Various algorithms employ different approaches to combine additional layers of omics data to limit the cone of allowable flux distributions in the metabolic model. In this review, we categorize all methods by five different grouping criteria and evaluate their practical perspectives. The first category of methods utilizes a threshold to distinguish active and inactive states of the corresponding reactions based on the gene expression measurement data. The second uses omics data to build cell- and tissue-specific models of human metabolism by removing unexpressed reactions from the global human metabolic network. The third category of methods involves modifying reaction bounds on the basis of mRNA and protein abundance, in which the width of the “flux cone” is adjusted via the maximum possible flux in the upper bound of the FBA optimization problem dependent on gene and protein expression levels. The imposition of constraints further defines the associated solution space of the model to improve the prediction accuracy. The fourth model incorporates transcriptional regulation networks (TRNs), which describe the phenomenological interactions between different biomolecules in response to genetic and environmental perturbation, into GEMs and avoids the obstacles of information formulation to achieve comprehensive knowledge regarding the metabolic and regulatory events occurring inside the cell. The last category integrates time-series transcriptomics data with flux-based bilevel optimization to comprehend the interplay between metabolism and regulation in time-dependent processes. We compare the advantages and limitations of different categories and explore the application areas of integrated models in analyzing metabolic characteristics, interpreting phenotypic states and the consequences of environmental and genetic perturbations while discovering potential drug targets and screening anti-metabolic drugs for cancer treatment. Finally, we also highlight the future perspectives and challenges for GEM-based reconstruction with omics data integration.
- Research Article
2
- 10.1016/j.bej.2023.108947
- Apr 25, 2023
- Biochemical Engineering Journal
Reconstruction and analysis of a genome-scale metabolic model for the gut bacteria Prevotella copri
- Dissertation
- 10.18174/416473
- Jul 4, 2017
Metabolic modeling to understand and redesign microbial systems
- Research Article
42
- 10.1093/bioinformatics/bts267
- May 4, 2012
- Bioinformatics
Genome-scale metabolic network models have become an indispensable part of the increasingly important field of systems biology. Metabolic systems biology studies usually include three major components-network model construction, objective- and experiment-guided model editing and visualization, and simulation studies based mainly on flux balance analyses. Bioinformatics tools are required to facilitate these complicated analyses. Although some of the required functions have been served separately by existing tools, a free software resource that simultaneously serves the needs of the three major components is not yet available. Here we present a software platform, GEMSiRV (GEnome-scale Metabolic model Simulation, Reconstruction and Visualization), to provide functionalities of easy metabolic network drafting and editing, amenable network visualization for experimental data integration and flux balance analysis tools for simulation studies. GEMSiRV comes with downloadable, ready-to-use public-domain metabolic models, reference metabolite/reaction databases and metabolic network maps, all of which can be input into GEMSiRV as the starting materials for network construction or simulation analyses. Furthermore, all of the GEMSiRV-generated metabolic models and analysis results, including projects in progress, can be easily exchanged in the research community. GEMSiRV is a powerful integrative resource that may facilitate the development of systems biology studies. The software is freely available on the web at http://sb.nhri.org.tw/GEMSiRV.
- Research Article
- 10.1007/s10994-025-06868-0
- Jan 1, 2025
- Machine Learning
Reasoning about hypotheses and updating knowledge through empirical observations are central to scientific discovery. In this work, we applied logic-based machine learning methods to drive biological discovery by guiding experimentation. Genome-scale metabolic network models (GEMs) - comprehensive representations of metabolic genes and reactions - are widely used to evaluate genetic engineering of biological systems. However, GEMs often fail to accurately predict the behaviour of genetically engineered cells, primarily due to incomplete annotations of gene interactions. The task of learning the intricate genetic interactions within GEMs presents computational and empirical challenges. To efficiently predict using GEM, we describe a novel approach called Boolean Matrix Logic Programming (BMLP) by leveraging Boolean matrices to evaluate large logic programs. We developed a new system, BMLP_{active}, which guides cost-effective experimentation and uses interpretable logic programs to encode a state-of-the-art GEM of a model bacterial organism. Notably, BMLP_{active} successfully learned the interaction between a gene pair with fewer training examples than random experimentation, overcoming the increase in experimental design space. BMLP_{active} enables rapid optimisation of metabolic models to reliably engineer biological systems for producing useful compounds. It offers a realistic approach to creating a self-driving lab for biological discovery, which would then facilitate microbial engineering for practical applications.
- Research Article
18
- 10.1038/s41598-020-64721-x
- May 8, 2020
- Scientific Reports
Zymomonas mobilis ZM4 has recently been used for a variety of biotechnological purposes. To rationally enhance its metabolic performance, a reliable genome-scale metabolic network model (GEM) of this organism is required. To this end, we reconstructed a genome-scale metabolic model (iHN446) for Z. mobilis, which involves 446 genes, 859 reactions, and 894 metabolites. We started by first reconciling the existing GEMs previously constructed for Z. mobilis to obtain a draft network. Next, recent gene annotations, up-to-date literature, physiological data and biochemical databases were used to upgrade the network. Afterward, the draft network went through a curative and iterative process of gap-filling by computational tools and manual refinement. The final model was evaluated using experimental data and literature information. We next applied this model as a platform for analyzing the links between transcriptome-flux and transcriptome-metabolome. We found that experimental observations were in agreement with the predicted results from our final GEM. Taken together, this comprehensive model (iHN446) can be utilized for studying metabolism in Z. mobilis and finding rational targets for metabolic engineering applications.
- Research Article
83
- 10.1016/j.ymben.2015.08.006
- Sep 8, 2015
- Metabolic Engineering
13C metabolic flux analysis at a genome-scale
- Research Article
240
- 10.1093/bioinformatics/btq183
- Jun 1, 2010
- Bioinformatics
Motivation: The availability of modern sequencing techniques has led to a rapid increase in the amount of reconstructed metabolic networks. Using these models as a platform for the analysis of high throughput transcriptomic, proteomic and metabolomic data can provide valuable insight into conditional changes in the metabolic activity of an organism. While transcriptomics and proteomics provide important insights into the hierarchical regulation of metabolic flux, metabolomics shed light on the actual enzyme activity through metabolic regulation and mass action effects. Here we introduce a new method, termed integrative omics-metabolic analysis (IOMA) that quantitatively integrates proteomic and metabolomic data with genome-scale metabolic models, to more accurately predict metabolic flux distributions. The method is formulated as a quadratic programming (QP) problem that seeks a steady-state flux distribution in which flux through reactions with measured proteomic and metabolomic data, is as consistent as possible with kinetically derived flux estimations.Results: IOMA is shown to successfully predict the metabolic state of human erythrocytes (compared to kinetic model simulations), showing a significant advantage over the commonly used methods flux balance analysis and minimization of metabolic adjustment. Thereafter, IOMA is shown to correctly predict metabolic fluxes in Escherichia coli under different gene knockouts for which both metabolomic and proteomic data is available, achieving higher prediction accuracy over the extant methods. Considering the lack of high-throughput flux measurements, while high-throughput metabolomic and proteomic data are becoming readily available, we expect IOMA to significantly contribute to future research of cellular metabolism.Contacts: kerenyiz@post.tau.ac.il; tomersh@cs.technion.ac.il
- Research Article
19
- 10.1007/s10529-013-1328-x
- Sep 28, 2013
- Biotechnology Letters
Elementary modes (EMs) are steady-state metabolic flux vectors with minimal set of active reactions. Each EM corresponds to a metabolic pathway. Therefore, studying EMs is helpful for analyzing the production of biotechnologically important metabolites. However, memory requirements for computing EMs may hamper their applicability as, in most genome-scale metabolic models, no EM can be computed due to running out of memory. In this study, we present a method for computing randomly sampled EMs. In this approach, a network reduction algorithm is used for EM computation, which is based on flux balance-based methods. We show that this approach can be used to recover the EMs in the medium- and genome-scale metabolic network models, while the EMs are sampled in an unbiased way. The applicability of such results is shown by computing “estimated” control-effective flux values in Escherichia coli metabolic network.
- Research Article
- 10.13345/j.cjb.210335
- Feb 25, 2022
- Sheng wu gong cheng xue bao = Chinese journal of biotechnology
Constraint-based genome-scale metabolic network models (genome-scale metabolic models, GEMs) have been widely used to predict metabolic phenotypes. In addition to stoichiometric constraints, other constraints such as enzyme availability and thermodynamic feasibility may also limit the cellular phenotype solution space. Recently, extended GEM models considering either enzymatic or thermodynamic constraints have been developed to improve model prediction accuracy. This review summarizes the recent progresses on metabolic models with multiple constraints (MCGEMs). We presented the construction methods and various applications of MCGEMs including the simulation of gene knockout, prediction of biologically feasible pathways and identification of bottleneck steps. By integrating multiple constraints in a consistent modeling framework, MCGEMs can predict the metabolic bottlenecks and key controlling and modification targets for pathway optimization more precisely, and thus may provide more reliable design results to guide metabolic engineering of industrially important microorganisms.
- Research Article
90
- 10.1093/nar/gkab815
- Oct 6, 2021
- Nucleic Acids Research
Metagenomic analyses of microbial communities have revealed a large degree of interspecies and intraspecies genetic diversity through the reconstruction of metagenome assembled genomes (MAGs). Yet, metabolic modeling efforts mainly rely on reference genomes as the starting point for reconstruction and simulation of genome scale metabolic models (GEMs), neglecting the immense intra- and inter-species diversity present in microbial communities. Here, we present metaGEM (https://github.com/franciscozorrilla/metaGEM), an end-to-end pipeline enabling metabolic modeling of multi-species communities directly from metagenomes. The pipeline automates all steps from the extraction of context-specific prokaryotic GEMs from MAGs to community level flux balance analysis (FBA) simulations. To demonstrate the capabilities of metaGEM, we analyzed 483 samples spanning lab culture, human gut, plant-associated, soil, and ocean metagenomes, reconstructing over 14,000 GEMs. We show that GEMs reconstructed from metagenomes have fully represented metabolism comparable to isolated genomes. We demonstrate that metagenomic GEMs capture intraspecies metabolic diversity and identify potential differences in the progression of type 2 diabetes at the level of gut bacterial metabolic exchanges. Overall, metaGEM enables FBA-ready metabolic model reconstruction directly from metagenomes, provides a resource of metabolic models, and showcases community-level modeling of microbiomes associated with disease conditions allowing generation of mechanistic hypotheses.
- Research Article
54
- 10.1002/biot.201100159
- Nov 29, 2011
- Biotechnology Journal
Systems biology has greatly contributed toward the analysis and understanding of biological systems under various genotypic and environmental conditions on a much larger scale than ever before. One of the applications of systems biology can be seen in unraveling and understanding complicated human diseases where the primary causes for a disease are often not clear. The in silico genome-scale metabolic network models can be employed for the analysis of diseases and for the discovery of novel drug targets suitable for treating the disease. Also, new antimicrobial targets can be discovered by analyzing, at the systems level, the genome-scale metabolic network of pathogenic microorganisms. Such applications are possible as these genome-scale metabolic network models contain extensive stoichiometric relationships among the metabolites constituting the organism's metabolism and information on the associated biophysical constraints. In this review, we highlight applications of genome-scale metabolic network modeling and simulations in predicting drug targets and designing potential strategies in combating pathogenic infection. Also, the use of metabolic network models in the systematic analysis of several human diseases is examined. Other computational and experimental approaches are discussed to complement the use of metabolic network models in the analysis of biological systems and to facilitate the drug discovery pipeline.
- Research Article
- 10.37532/2048-9145.2021.9(1).8-9
- May 6, 2021
- Pharmaceutical bioprocessing
- Research Article
- 10.37532/2048-9145.2021.9(1).e001
- May 4, 2021
- Pharmaceutical bioprocessing
- Research Article
- 10.37532/2048-9145.2021.9(1).2-3
- Feb 19, 2021
- Pharmaceutical bioprocessing
- Research Article
- 10.37532/2048-9145.2021.9(1).6-7
- Feb 19, 2021
- Pharmaceutical bioprocessing
- Research Article
- 10.37532/2048-9145.2021.9(1).4-5
- Feb 19, 2021
- Pharmaceutical bioprocessing
- Research Article
- 10.37532/2048-9145.2021.9(3).24-25
- Jan 1, 2021
- Pharmaceutical bioprocessing
- Research Article
- 10.37532/2048-9145.2021.9(3).22-23
- Jan 1, 2021
- Pharmaceutical bioprocessing
- Research Article
- 10.37532/2048-9145.2021.9(3).e28
- Jan 1, 2021
- Pharmaceutical bioprocessing
- Research Article
- 10.37532/2048-9145.2021.9(3).20-21
- Jan 1, 2021
- Pharmaceutical bioprocessing
- Research Article
- 10.37532/2048-9145.2021.9(3).26-27
- Jan 1, 2021
- Pharmaceutical bioprocessing
- Ask R Discovery
- Chat PDF
AI summaries and top papers from 250M+ research sources.