Abstract

Because of its ability to generate biological hypotheses, metabolomics offers an innovative and promising approach in many fields, including clinical research. However, collecting specimens in this setting can be difficult to standardize, especially when groups of patients with different degrees of disease severity are considered. In addition, despite major technological advances, it remains challenging to measure all the compounds defining the metabolic network of a biological system. In this context, the characterization of samples based on several analytical setups is now recognized as an efficient strategy to improve the coverage of metabolic complexity. For this purpose, chemometrics proposes efficient methods to reduce the dimensionality of these complex datasets spread over several matrices, allowing the integration of different sources or structures of metabolic information. Bioinformatics databases and query tools designed to describe and explore metabolic network models offer extremely useful solutions for the contextualization of potential biomarker subsets, enabling mechanistic hypotheses to be considered rather than simple associations. In this study, network principal component analysis was used to investigate samples collected from three cohorts of patients including multiple stages of chronic kidney disease. Metabolic profiles were measured using a combination of four analytical setups involving different separation modes in liquid chromatography coupled to high resolution mass spectrometry. Based on the chemometric model, specific patterns of metabolites, such as N-acetyl amino acids, could be associated with the different subgroups of patients. Further investigation of the metabolic signatures carried out using genome-scale network modeling confirmed both tryptophan metabolism and nucleotide interconversion as relevant pathways potentially associated with disease severity. Metabolic modules composed of chemically adjacent or close compounds of biological relevance were further investigated using carbon transfer reaction paths. Overall, the proposed integrative data analysis strategy allowed deeper insights into the metabolic routes associated with different groups of patients to be gained. Because of their complementary role in the knowledge discovery process, the association of chemometrics and bioinformatics in a common workflow is therefore shown as an efficient methodology to gain meaningful insights in a clinical context.

Highlights

  • While efforts are still being made to improve both technological and computational aspects, metabolomics is recognized as an essential approach to assess biochemical phenotypes in many application fields, including clinical research

  • 393 blood samples were collected from the predefined cohorts of patients grouped into different categories according to their renal status: 56 healthy control volunteers (CTRL), 69 chronic kidney disease (CKD) patients at intermediate stage (ICKD), 35 patients with end-stage renal disease (ESRD) undergoing HD (HD), 42 ESRD undergoing kidney graft (KG) and 24 healthy LKD (DV)

  • The association of chemometrics and bioinformatics in a common workflow was shown to be an effective approach for the integrative analysis of samples collected from several groups of patients suffering from multiple stages of CKD and/or undergoing different treatments

Read more

Summary

Introduction

While efforts are still being made to improve both technological and computational aspects, metabolomics is recognized as an essential approach to assess biochemical phenotypes in many application fields, including clinical research. Metabolomic experiments often generate large amounts of high-dimensional and complex biochemical data involving multiple signals measured from thousands of low molecular weight compounds. Dedicated strategies need to be applied to extract meaningful biological knowledge from the collected MS data (Boccard et al, 2010). The integration of data collected from different sample preparation protocols, separation principles, ionization modes or analytical platforms has been recognized as an efficient strategy to improve the metabolome coverage of complex samples, potentially offering better understanding of the underlying biological mechanisms associated with a given phenotypic pattern (Richards et al, 2010). Dedicated data mining tools accounting adequately for metabolomic signals spread over multiple data tables are needed, and chemometrics offers potent solutions for data integration based on dimensionality reduction approaches. Multivariate models able to handle multiple blocks of variables (multiblock) associated with different groups of observations (multigroup) are established as effective methods for data integration in omics disciplines (Boccard and Rudaz, 2014)

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call