Abstract

BackgroundThe analysis of LC-MS metabolomic datasets appears to be a challenging task in a wide range of disciplines since it demands the highly extensive processing of a vast amount of data. Different LC-MS data analysis packages have been developed in the last few years to facilitate this analysis. However, most of these strategies involve chromatographic alignment and peak shaping and often associate each “feature” (i.e., chromatographic peak) with a unique m/z measurement. Thus, the development of an alternative data analysis strategy that is applicable to most types of MS datasets and properly addresses these issues is still a challenge in the metabolomics field.ResultsHere, we present an alternative approach called ROIMCR to: i) filter and compress massive LC-MS datasets while transforming their original structure into a data matrix of features without losing relevant information through the search of regions of interest (ROIs) in the m/z domain and ii) resolve compressed data to identify their contributing pure components without previous alignment or peak shaping by applying a Multivariate Curve Resolution-Alternating Least Squares (MCR-ALS) analysis. In this study, the basics of the ROIMCR method are presented in detail and a detailed description of its implementation is also provided. Data were analyzed using the MATLAB (The MathWorks, Inc., www.mathworks.com) programming and computing environment. The application of the ROIMCR methodology is described in detail, with an example of LC-MS data generated in a lipidomic study and with other examples of recent applications.ConclusionsThe methodology presented here combines the benefits of data filtering and compression based on the searching of ROI features, without the loss of spectral accuracy. The method has the benefits of the application of the powerful MCR-ALS data resolution method without the necessity of performing chromatographic peak alignment or modelling. The presented method is a powerful alternative to other existing data analysis approaches that do not use the MCR-ALS method to resolve LC-MS data. The ROIMCR method also represents an improved strategy compared to the direct applications of the MCR-ALS method that use less-powerful data compression strategies such as binning and windowing. Overall, the strategy presented here confirms the usefulness of the ROIMCR chemometrics method for analyzing LC-MS untargeted metabolomics data.

Highlights

  • The analysis of liquid chromatography coupled to mass spectrometry (LC-Mass spectrometry (MS)) metabolomic datasets appears to be a challenging task in a wide range of disciplines since it demands the highly extensive processing of a vast amount of data

  • Implementation of the ROIMCR procedure The Regions of interest (ROI) compression procedure presented in this study has been implemented as command line functions in the MATLAB environment available at http://cidtransfer.cid. csic.es/descarga.php?enlace1=298348e5b34daf9e8448353 52bafa645250ee1 and at www.mcrals.info

  • Results the dataset used as example in the present study was already used in previous studies by the authors [16, 17], the results presented here were not presented in the previous publications and are selected to show the key features of ROIMCR methodology in the present study

Read more

Summary

Introduction

The analysis of LC-MS metabolomic datasets appears to be a challenging task in a wide range of disciplines since it demands the highly extensive processing of a vast amount of data. Some of the most frequently used open-source software packages include XCMS [3, 4] (and XCMS-based Metabox [5], metaX [6]), CAMERA [7], MAIT [8], MetaboAnalyst [9], Workflow4Metabolomics [10], MZmine [11] and MetAlign [12] None of these approaches are highlighted as the best strategy, and the analysis of LC-MS data remains an unresolved problem in the bioinformatics field due to the methodological discrepancies existing among these approaches. High-resolution raw mass spectra are converted into a matrix representation by dividing the m/z axis into parts with a specific bin size that is generally set to a multiple of the mass accuracy of the mass spectrometer. When performing windowing, the whole process is more tedious and time-consuming, since one sample must be analyzed in several parts

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call