NMR Data Sets Research Articles

Metabolomics commonly relies on using one-dimensional (1D) 1H NMR spectroscopy or liquid chromatography-mass spectrometry (LC-MS) to derive scientific insights from large collections of biological samples. NMR and MS approaches to metabolomics require, among other issues, a data processing pipeline. Quantitative assessment of the performance of these software platforms is challenged by a lack of standardized data sets with "known" outcomes. To resolve this issue, we created a novel simulated LC-MS data set with known peak locations and intensities, defined metabolite differences between groups (i.e., fold change > 2, coefficient of variation ≤ 25%), and different amounts of added Gaussian noise (0, 5, or 10%) and missing features (0, 10, or 20%). This data set was developed to improve benchmarking of existing LC-MS metabolomics software and to validate the updated version of our MVAPACK software, which added gas chromatography-MS and LC-MS functionality to its existing 1D and two-dimensional NMR data processing capabilities. We also included two experimental LC-MS data sets acquired from a standard mixture andMycobacterium smegmatiscell lysates since a simulated data set alone may not capture all the unique characteristics and variability of real spectra needed to assess software performance properly. Our simulated and experimental LC-MS data sets were processed with the MS-DIAL and XCMSOnline software packages and our MVAPACK toolkit to showcase the utility of our data sets to benchmark MVAPACK against community standards. Our results demonstrate the enhanced objectivity and clarity of software assessment that can be achieved when both simulated and experimental data are employed since distinctly different software performances were observed with the simulated and experimental LC-MS data sets. We also demonstrate that the performance of MVAPACK is equivalent to or exceeds existing LC-MS software programs while providing a single platform for processing and analyzing both NMR and MS data sets.

Read full abstract

This article presents a novel method for quantifying water saturation in oil-sand reservoirs by employing 1D low-field nuclear magnetic resonance (LF-NMR) spin-spin relaxation and bulk density measurements as indicators of pore volume variations. One of the challenges in accurately determining the volumes of bitumen and water in oil-sands is the effective separation of their overlapping T2 signals, attributed to similar spin-spin relaxation decay times and diffusive coupling. Conventional methods require deconvolution of T2 peaks and or experimentation to determine T2 cutoff values, differentiating between bitumen and water signals, notably capillary and clay-bound water. In contrast, our approach predicts the proportion of water by utilizing matrix decomposition methods to compress the T2 relaxation distribution and extract significant components. These components subsequently train the regression model, facilitating the accurate estimation of relative water saturation percentages.The NMR dataset was obtained by benchtop LF-NMR T2 measurements from 82 oil-sand samples, with preserved bitumen and water saturations at both reservoir and ambient temperatures (6 °C and 25 °C), yielding 164 observations. We examined four matrix decomposition methods, including principal component analysis, its variation integrating a kernel function, canonical correlation analysis, and partial least squares regression. X-ray CT measurements and Dean-Stark extraction ascertained the respective sample bulk densities and fluid-solid volume proportions.The PCA model prediction statistics (RMSE = 0.86%, R2 = 0.84), indicate its application can be extended for saturation prediction from NMR and bulk density well logs. Moreover, we underscore the importance of incorporating bulk density measurements and establish the statistical and physical correlations between these measurements and NMR T2 relaxation, providing insights into the approach's efficacy and causality.

Read full abstract

NMR Data Sets Research Articles

Related Topics

Articles published on NMR Data Sets

Simulated LC-MS Data Set for Assessing the Metabolomics Data Processing Pipeline Implemented into MVAPACK.

Discovery of olimycin E from Streptomyces sp. 11695.

Brewing alcohol 101: An undergraduate experiment utilizing benchtop NMR for quantification and process monitoring.

The 100-protein NMR spectra dataset: A resource for biomolecular NMR data analysis

Matrix decomposition methods for accurate water saturation prediction in Canadian oil-sands by LF-NMR T2 measurements

Chemical profiling of botanical extracts obtained in NADES systems using centrifugal partition chromatography combined with 13 C NMR dereplication-Hypericum perforatum as a case study.

Synergistic Combination of NAPROC-13 and NMR 13C DFT Calculations: A Powerful Approach for Revising the Structure of Natural Products.

Rapid Chemical Profiling of Filipendula ulmaria Using CPC Fractionation, 2-D Mapping of 13C NMR Data, and High-Resolution LC-MS.

Hosimosines A-E, structurally diverse cytisine derivatives from the seeds of Ormosia hosiei Hemsl. et Wils

Integrated NMR/Molecular Dynamics Determination of the Ensemble Conformation of a Thermodynamically Stable CUUG RNA Tetraloop.

Blind assessment of monomeric AlphaFold2 protein structure models with experimental NMR data

Study of coffee sensory attributes by ordered predictors selection applied to 1H NMR spectroscopy

Determining sequential micellization steps of bile salts with multi-CMC modeling

DELTA50: A Highly Accurate Database of Experimental 1H and 13C NMR Chemical Shifts Applied to DFT Benchmarking

Modelling solution structures of cyclic peptides - How good are we?

DD-ComDim: A data-driven multiblock approach for one-class classifiers

Antimicrobial Polyketides from the Marine-Derived Fungus Spiromastix sp. SCSIO F190.

Application of Machine Learning Solutions to Optimize Parameter Prediction to Enhance Automatic NMR Metabolite Profiling.

Modular Pulse Program Generation for NMR Supersequences.

Metabolomic Profiling of Malaysian and New Zealand Honey Using Concatenated NMR and HRMS Datasets.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

NMR Data Sets Research Articles

Related Topics

Articles published on NMR Data Sets

Simulated LC-MS Data Set for Assessing the Metabolomics Data Processing Pipeline Implemented into MVAPACK.

Discovery of olimycin E from Streptomyces sp. 11695.

Brewing alcohol 101: An undergraduate experiment utilizing benchtop NMR for quantification and process monitoring.

The 100-protein NMR spectra dataset: A resource for biomolecular NMR data analysis

Matrix decomposition methods for accurate water saturation prediction in Canadian oil-sands by LF-NMR T2 measurements

Chemical profiling of botanical extracts obtained in NADES systems using centrifugal partition chromatography combined with 13 C NMR dereplication-Hypericum perforatum as a case study.

Synergistic Combination of NAPROC-13 and NMR 13C DFT Calculations: A Powerful Approach for Revising the Structure of Natural Products.

Rapid Chemical Profiling of Filipendula ulmaria Using CPC Fractionation, 2-D Mapping of 13C NMR Data, and High-Resolution LC-MS.

Hosimosines A-E, structurally diverse cytisine derivatives from the seeds of Ormosia hosiei Hemsl. et Wils

Integrated NMR/Molecular Dynamics Determination of the Ensemble Conformation of a Thermodynamically Stable CUUG RNA Tetraloop.

Blind assessment of monomeric AlphaFold2 protein structure models with experimental NMR data

Study of coffee sensory attributes by ordered predictors selection applied to 1H NMR spectroscopy

Determining sequential micellization steps of bile salts with multi-CMC modeling

DELTA50: A Highly Accurate Database of Experimental 1H and 13C NMR Chemical Shifts Applied to DFT Benchmarking

Modelling solution structures of cyclic peptides - How good are we?

DD-ComDim: A data-driven multiblock approach for one-class classifiers

Antimicrobial Polyketides from the Marine-Derived Fungus Spiromastix sp. SCSIO F190.

Application of Machine Learning Solutions to Optimize Parameter Prediction to Enhance Automatic NMR Metabolite Profiling.

Modular Pulse Program Generation for NMR Supersequences.

Metabolomic Profiling of Malaysian and New Zealand Honey Using Concatenated NMR and HRMS Datasets.