Kernel approaches for differential expression analysis of mass spectrometry-based metabolomics data.

Xiang Zhan,Debashis Ghosh,Andrew D Patterson

doi:10.1186/s12859-015-0506-3

Abstract

BackgroundData generated from metabolomics experiments are different from other types of “-omics” data. For example, a common phenomenon in mass spectrometry (MS)-based metabolomics data is that the data matrix frequently contains missing values, which complicates some quantitative analyses. One way to tackle this problem is to treat them as absent. Hence there are two types of information that are available in metabolomics data: presence/absence of a metabolite and a quantitative value of the abundance level of a metabolite if it is present. Combining these two layers of information poses challenges to the application of traditional statistical approaches in differential expression analysis.ResultsIn this article, we propose a novel kernel-based score test for the metabolomics differential expression analysis. In order to simultaneously capture both the continuous pattern and discrete pattern in metabolomics data, two new kinds of kernels are designed. One is the distance-based kernel and the other is the stratified kernel. While we initially describe the procedures in the case of single-metabolite analysis, we extend the methods to handle metabolite sets as well.ConclusionsEvaluation based on both simulated data and real data from a liver cancer metabolomics study indicates that our kernel method has a better performance than some existing alternatives. An implementation of the proposed kernel method in the R statistical computing environment is available at http://works.bepress.com/debashis_ghosh/60/.Electronic supplementary materialThe online version of this article (doi:10.1186/s12859-015-0506-3) contains supplementary material, which is available to authorized users.

Highlights

Data generated from metabolomics experiments are different from other types of “-omics” data
Mass spectrometry (MS)-based metabolomics data is typically characterized by high dimensionality, small sample size, high correlation structure between metabolites, redundant information, and especially the sparse data matrix, which is comprised of the samples, the variable ID (m/z, retention time), and peak area
We focus on differential expression analysis of mass spectrometry (MS)-based metabolomics data

Summary

Introduction

Data generated from metabolomics experiments are different from other types of “-omics” data. There are two types of information that are available in metabolomics data: presence/absence of a metabolite and a quantitative value of the abundance level of a metabolite if it is present Combining these two layers of information poses challenges to the application of traditional statistical approaches in differential expression analysis. We focus on differential expression analysis of MS-based metabolomics data This fundamental approach is to compare the abundance level of a metabolite between an experimental group and a control group, and to use statistics to assess the significance of any differences. This kind of study strongly supports the value of proper identification of putative oncometabolomic markers [5,6]

Objectives

Methods

Results

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC bioinformatics	Publication Date: Mar 11, 2015
Citations: 27	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Kernel approaches for differential expression analysis of mass spectrometry-based metabolomics data.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC bioinformatics

Lead the way for us

Similar Papers

Editor's evaluation: Comparative transcriptomic analysis reveals translationally relevant processes in mouse models of malaria
Urszula Krzych
-
Urszula KrzychUrszula Krzych
11 Aug 2021
11 Aug 2021

MultiRankSeq: multiperspective approach for RNAseq differential expression analysis and quality control.
Yan Guo ... Yu Shyr
BioMed research international | VOL. 2014
Yan Guo, et. al.Yan Guo ... Yu Shyr
01 Jan 2014
BioMed research international | VOL. 2014

A New Approach of Outlier-robust Missing Value Imputation for Metabolomics Data Analysis
Nishith Kumar ... Md Shahjaman
Current Bioinformatics | VOL. 14
Nishith Kumar, et. al.Nishith Kumar ... Md Shahjaman
06 Dec 2018
Current Bioinformatics | VOL. 14

Development of Compendium for Esophageal Squamous Cell Carcinoma.
Lucky Krishnia ... Manoj Kumar Kashyap
Journal of visualized experiments : JoVE | VOL. -
Lucky Krishnia, et. al.Lucky Krishnia ... Manoj Kumar Kashyap
12 Apr 2024
Journal of visualized experiments : JoVE | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Kernel approaches for differential expression analysis of mass spectrometry-based metabolomics data.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC bioinformatics