A data preprocessing strategy for metabolomics to reduce the mask effect in data analysis.

Jun Yang,Guowang Xu,Xiaohui Lin,Xinjie Zhao,Xin Lu

doi:10.3389/fmolb.2015.00004

Abstract

Highlights Developed a data preprocessing strategy to cope with missing values and mask effects in data analysis from high variation of abundant metabolites.A new method- ‘x-VAST’ was developed to amend the measurement deviation enlargement.Applying the above strategy, several low abundant masked differential metabolites were rescued.Metabolomics is a booming research field. Its success highly relies on the discovery of differential metabolites by comparing different data sets (for example, patients vs. controls). One of the challenges is that differences of the low abundant metabolites between groups are often masked by the high variation of abundant metabolites. In order to solve this challenge, a novel data preprocessing strategy consisting of three steps was proposed in this study. In step 1, a ‘modified 80%’ rule was used to reduce effect of missing values; in step 2, unit-variance and Pareto scaling methods were used to reduce the mask effect from the abundant metabolites. In step 3, in order to fix the adverse effect of scaling, stability information of the variables deduced from intensity information and the class information, was used to assign suitable weights to the variables. When applying to an LC/MS based metabolomics dataset from chronic hepatitis B patients study and two simulated datasets, the mask effect was found to be partially eliminated and several new low abundant differential metabolites were rescued.

Highlights

We have developed a novel data preprocessing strategy to cope with the missing values and eliminate mask effects in data analysis from high variation of abundant metabolites
PLASMA SAMPLES AND HIGH PERFORMANCE LIQUID CHROMATOGRAPHY-MASS SPECTROMETRY (HPLC-MS) ANALYSIS Thirty seven chronic hepatitis B patients hospitalized for acute deterioration in liver function and 50 healthy individuals were enrolled in this study
The data preprocessing is a critical step in information mining of metabolomics studies, it directly influences the discovery of differential biomarkers

Summary

Introduction

Metabolomics has been successfully applied in many fields including clinical research (Brindle et al, 2002; Yang et al, 2004, 2005; Abate-Shen and Shen, 2009; Sreekumar et al, 2009), drug discovery (Kell and Goodacre, 2014), toxicology (Keun, 2006; van Ravenzwaay et al, 2014), and phytochemistry (Fiehn, 2002; Mari et al, 2013). A general strategy of data (pre-) processing and validation for human metabolomics studies was given by Bijlsma et al (2006). They didn’t describe how the data preprocessing method affects the results and what data preprocessing methods are to be selected for a given study

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Molecular Biosciences	Publication Date: Feb 2, 2015
Citations: 88	License type: cc-by

R Discovery Prime

R Discovery Prime

A data preprocessing strategy for metabolomics to reduce the mask effect in data analysis.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Molecular Biosciences

Lead the way for us

Similar Papers

Enhanced flux prediction by integrating relative expression and relative metabolite abundance into thermodynamically consistent metabolic models.
Vikash Pandey ... Noushin Hadadi
PLOS Computational Biology | VOL. 15
Vikash Pandey, et. al.Vikash Pandey ... Noushin Hadadi
13 May 2019
PLOS Computational Biology | VOL. 15

Blood metabolic and physiological profiles of Bama miniature pigs at different growth stages
Jiayuan Mo ... Jing Liang
Porcine Health Management | VOL. 8
Jiayuan Mo, et. al.Jiayuan Mo ... Jing Liang
08 Aug 2022
Porcine Health Management | VOL. 8

Metabolomics analysis shows the differences in metabolites in deer antler bases of red deer and sika deer
Zhenxiang Zhang ... Zhaonan Li
Animal Production Science | VOL. 63
Zhenxiang Zhang, et. al.Zhenxiang Zhang ... Zhaonan Li
01 Jan 2023
Animal Production Science | VOL. 63

Short-term continuous cropping leads to a decline in rhizosphere soil fertility by modulating the perilla root exudates
Yaqi Liu ... Fuqiang Song
Rhizosphere | VOL. 32
Yaqi Liu, et. al.Yaqi Liu ... Fuqiang Song
01 Dec 2024
Rhizosphere | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A data preprocessing strategy for metabolomics to reduce the mask effect in data analysis.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Molecular Biosciences