Abstract
Automatic reconstruction of metabolic pathways for an organism from genomics and transcriptomics data has been a challenging and important problem in bioinformatics. Traditionally, known reference pathways can be mapped into an organism-specific ones based on its genome annotation and protein homology. However, this simple knowledge-based mapping method might produce incomplete pathways and generally cannot predict unknown new relations and reactions. In contrast, ab initio metabolic network construction methods can predict novel reactions and interactions, but its accuracy tends to be low leading to a lot of false positives.Here we combine existing pathway knowledge and a new ab initio Bayesian probabilistic graphical model together in a novel fashion to improve automatic reconstruction of metabolic networks. Specifically, we built a knowledge database containing known, individual gene / protein interactions and metabolic reactions extracted from existing reference pathways. Known reactions and interactions were then used as constraints for Bayesian network learning methods to predict metabolic pathways. Using individual reactions and interactions extracted from different pathways of many organisms to guide pathway construction is new and improves both the coverage and accuracy of metabolic pathway construction. We applied this probabilistic knowledge-based approach to construct the metabolic networks from yeast gene expression data and compared its results with 62 known metabolic networks in the KEGG database. The experiment showed that the method improved the coverage of metabolic network construction over the traditional reference pathway mapping method and was more accurate than pure ab initio methods.
Highlights
A metabolic pathway is a network of related chemical reactions catalyzed by enzymes that collaboratively produce or degrade one or a few metabolites
Experiment design In order to evaluate the pathways predicted by the computation methods, we collected a list of known yeast metabolic pathways in the Kyoto Encyclopedia of Genes and Genomes (KEGG) database as presumably true pathways
The results of this study indicate that the knowledge-based network inference approach was effective in predicting yeast metabolic pathways
Summary
A metabolic pathway is a network of related chemical reactions catalyzed by enzymes that collaboratively produce or degrade one or a few metabolites. Reconstruction of metabolic networks (pathways) plays an important role in studying biological systems. Together with other types of biological networks, metabolic pathways can help decipher relationships between genotype and phenotype, and elucidate essential mechanisms underlying cellular physiology [1]. Most known metabolic pathways stored in the pathway databases such as the Kyoto Encyclopedia of Genes and Genomes (KEGG) [2,3] have been manually curated from the literature. The high-quality manual annotations of metabolic pathways are valuable resources for studying metabolisms, but they only account for a small portion of pathways in most organisms. Automatic computational reconstruction of metabolic pathways has been an important problem to solve in bioinformatics and computational biology.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.