Improved batch correction in untargeted MS-based metabolomics.

Ron Wehrens,Rik Kooke,Henriëtte D L M Van Eekelen,Joost J B Keurentjes,Erik Wijnker,Pádraic J Flood,Jos A Hageman,Ric C H De Vos,Roland Mumm,Fred Van Eeuwijk,Robert D Hall,Arjen Lommen

doi:10.1007/s11306-016-1015-8

Abstract

IntroductionBatch effects in large untargeted metabolomics experiments are almost unavoidable, especially when sensitive detection techniques like mass spectrometry (MS) are employed. In order to obtain peak intensities that are comparable across all batches, corrections need to be performed. Since non-detects, i.e., signals with an intensity too low to be detected with certainty, are common in metabolomics studies, the batch correction methods need to take these into account. ObjectivesThis paper aims to compare several batch correction methods, and investigates the effect of different strategies for handling non-detects.MethodsBatch correction methods usually consist of regression models, possibly also accounting for trends within batches. To fit these models quality control samples (QCs), injected at regular intervals, can be used. Also study samples can be used, provided that the injection order is properly randomized. Normalization methods, not using information on batch labels or injection order, can correct for batch effects as well. Introducing two easy-to-use quality criteria, we assess the merits of these batch correction strategies using three large LC–MS and GC–MS data sets of samples from Arabidopsis thaliana.ResultsThe three data sets have very different characteristics, leading to clearly distinct behaviour of the batch correction strategies studied. Explicit inclusion of information on batch and injection order in general leads to very good corrections; when enough QCs are available, also general normalization approaches perform well. Several approaches are shown to be able to handle non-detects—replacing them with very small numbers such as zero seems the worst of the approaches considered.ConclusionThe use of quality control samples for batch correction leads to good results when enough QCs are available. If an experiment is properly set up, batch correction using the study samples usually leads to a similar high-quality correction, but has the advantage that more metabolites are corrected. The strategy for handling non-detects is important: choosing small values like zero can lead to suboptimal batch corrections.

Highlights

Batch effects in large untargeted metabolomics experiments are almost unavoidable, especially when sensitive detection techniques like mass spectrometry (MS) are employed
The use of quality control samples for batch correction leads to good results when enough QCs are available
If an experiment is properly set up, batch correction using the study samples usually leads to a similar high-quality correction, but has the advantage that more metabolites are corrected

Summary

Introduction

Batch effects in large untargeted metabolomics experiments are almost unavoidable, especially when sensitive detection techniques like mass spectrometry (MS) are employed. In order to obtain peak intensities that are comparable across all batches, corrections need to be performed. Since non-detects, i.e., signals with an intensity too low to be detected with certainty, are common in metabolomics studies, the batch correction methods need to take these into account. Mass spectrometry (MS) is the dominant detection technique in untargeted metabolomics experiments due to its sensitivity and information content. In many cases it allows tentative annotations of metabolites on the basis of. Samples in metabolomics studies typically consist of complex matrices containing a large number of metabolites. In particular batch-tobatch variation is commonly seen, where a batch is defined as a set of samples that have been extracted as well as measured in one uninterrupted sequence

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Metabolomics	Publication Date: Mar 18, 2016
Citations: 208	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Improved batch correction in untargeted MS-based metabolomics.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Metabolomics

Lead the way for us

Similar Papers

Comparison of statistical methods and the use of quality control samples for batch effect correction in human transcriptome data.
Almudena Espín-Pérez ... Theo M C M De Kok
PLOS ONE | VOL. 13
Almudena Espín-Pérez, et. al.Almudena Espín-Pérez ... Theo M C M De Kok
30 Aug 2018
PLOS ONE | VOL. 13

DeepMNN: Deep Learning-Based Single-Cell RNA Sequencing Data Batch Correction Using Mutual Nearest Neighbors.
Bin Zou ... Tongda Zhang
Frontiers in Genetics | VOL. 12
Bin Zou, et. al.Bin Zou ... Tongda Zhang
10 Aug 2021
Frontiers in Genetics | VOL. 12

AMDBNorm: an approach based on distribution adjustment to eliminate batch effects of gene expression data.
Xu Zhang ... Feng Qiao
Briefings in Bioinformatics | VOL. 23
Xu Zhang, et. al.Xu Zhang ... Feng Qiao
28 Dec 2021
Briefings in Bioinformatics | VOL. 23

Abstract 1216: Combinatory technologies for single sample gene expression projection onto a cohort sequenced with a different technology for personalized clinical decision-making
Nikita Kotlov ... Elena Vasileva
Cancer Research | VOL. 82
Nikita Kotlov, et. al.Nikita Kotlov ... Elena Vasileva
15 Jun 2022
Cancer Research | VOL. 82

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improved batch correction in untargeted MS-based metabolomics.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Metabolomics