Abstract

The identification of compounds in complex mixtures remains challenging despite recent advances in analytical techniques. At present, no single method can detect and quantify the vast array of compounds that might be of potential interest in metabolomics studies. High performance liquid chromatography/mass spectrometry (HPLC/MS) is often considered the analytical method of choice for analysis of biofluids. The positive identification of an unknown involves matching at least two orthogonal HPLC/MS measurements (exact mass, retention index, drift time etc.) against an authentic standard. However, due to the limited availability of authentic standards, an alternative approach involves matching known and measured features of the unknown compound with computationally predicted features for a set of candidate compounds downloaded from a chemical database. Computationally predicted features include retention index, ECOM50 (energy required to decompose 50% of a selected precursor ion in a collision induced dissociation cell), drift time, whether the unknown compound is biological or synthetic and a collision induced dissociation (CID) spectrum. Computational predictions are used to filter the initial “bin” of candidate compounds. The final output is a ranked list of candidates that best match the known and measured features. In this mini review, we discuss cheminformatics methods underlying this database search-filter identification approach.

Highlights

  • Metabolomics focuses on the study of small molecules produced by metabolic processes within a cell

  • The chemical space of non- chosen for High performance liquid chromatography/mass spectrometry (HPLC/MS) based Quantitative structure property relationship (QSPR) modeling might include HPLC

  • Current retention index (RI) and ECOM50 models allow for the removal of 28% of compounds from PubChem bins [8]

Read more

Summary

Introduction

Metabolomics focuses on the study of small molecules The physiochemical properties molecular weight of 2,250 Da. The chemical space of non- chosen for HPLC/MS based QSPR modeling might include HPLC metabolites was represented by a random set of commercially available compounds in the mass range 17 – 1,006 Da (average 374 ± 95 Da) from the ZINC [24] chemical database. The chemical space of non- chosen for HPLC/MS based QSPR modeling might include HPLC metabolites was represented by a random set of commercially available compounds in the mass range 17 – 1,006 Da (average 374 ± 95 Da) from the ZINC [24] chemical database Both chemical spaces were retention index (RI), ECOM50 and drift time. The authors were able to use the protonated structure based Molconn

ECOM50
Findings
Summary and Outlook
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.