Abstract

In this manuscript, we propose a novel approach to assess relationships between environment and metabolic networks. We used a comprehensive dataset of more than 5000 prokaryotic species from which we derived the metabolic networks. We compute the scope from the reconstructed graphs, which is the set of all metabolites and reactions that can potentially be synthesized when provided with external metabolites. We show using machine learning techniques that the scope is an excellent predictor of taxonomic and environmental variables, namely growth temperature, oxygen tolerance, and habitat. In the literature, metabolites and pathways are rarely used to discriminate species. We make use of the scope underlying structure—metabolites and pathways—to construct the predictive models, giving additional information on the important metabolic pathways needed to discriminate the species, which is often absent in other metabolic network properties. For example, in the particular case of growth temperature, glutathione biosynthesis pathways are specific to species growing in cold environments, whereas tungsten metabolism is specific to species in warm environments, as was hinted in current literature. From a machine learning perspective, the scope is able to reduce the dimension of our data, and can thus be considered as an interpretable graph embedding.

Highlights

  • In this manuscript, we propose a novel approach to assess relationships between environment and metabolic networks

  • Through the use of large metabolic databases such as the Kyoto Encyclopedia of Genes and Genomes (KEGG)[1], it is possible to have access to genomic, enzymatic, and metabolic information, and deduce some complex interactions happening inside the organisms

  • The metabolism of living organisms can be described with metabolic networks

Read more

Summary

Introduction

We propose a novel approach to assess relationships between environment and metabolic networks. Through the use of large metabolic databases such as the Kyoto Encyclopedia of Genes and Genomes (KEGG)[1], it is possible to have access to genomic, enzymatic, and metabolic information, and deduce some complex interactions happening inside the organisms In this context, metabolic networks have been used to study the set of all chemical reactions of organisms in a holistic manner, providing insights into its underlying structure and the different adaptations of these organisms to their environment. There is one with metabolites and enzymes as nodes, in which each substrate of a reaction will be linked to the catalysing enzyme, which is in turn linked to the products of the reaction This is the representation we used when reconstructing our metabolic networks. We instead follow the first research direction, and are only interested in topological metrics for graph comparison

Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.