Abstract

BackgroundThe identification of differential metabolites in metabolomics is still a big challenge and plays a prominent role in metabolomics data analyses. Metabolomics datasets often contain outliers because of analytical, experimental, and biological ambiguity, but the currently available differential metabolite identification techniques are sensitive to outliers.ResultsWe propose a kernel weight based outlier-robust volcano plot for identifying differential metabolites from noisy metabolomics datasets. Two numerical experiments are used to evaluate the performance of the proposed technique against nine existing techniques, including the t-test and the Kruskal-Wallis test. Artificially generated data with outliers reveal that the proposed method results in a lower misclassification error rate and a greater area under the receiver operating characteristic curve compared with existing methods. An experimentally measured breast cancer dataset to which outliers were artificially added reveals that our proposed method produces only two non-overlapping differential metabolites whereas the other nine methods produced between seven and 57 non-overlapping differential metabolites.ConclusionOur data analyses show that the performance of the proposed differential metabolite identification technique is better than that of existing methods. Thus, the proposed method can contribute to analysis of metabolomics data with outliers. The R package and user manual of the proposed method are available at https://github.com/nishithkumarpaul/Rvolcano.

Highlights

  • The identification of differential metabolites in metabolomics is still a big challenge and plays a prominent role in metabolomics data analyses

  • The misclassification error rates (MERs) for differential metabolite identification were calculated for each method

  • The average MER, Area under the Receiver operating characteristic (ROC) curve (AUC) and Partial area under the ROC curve (pAUC) values for the artificial datasets are shown in the Additional file 1: Table S1

Read more

Summary

Introduction

The identification of differential metabolites in metabolomics is still a big challenge and plays a prominent role in metabolomics data analyses. Molecular omics studies- like genomics, transcriptomics, proteomics and metabolomics are playing a prominent role in life sciences, health and biological research [1]. Among these approaches, metabolomics is frequently used to understand biological metabolic status, making a direct link between genotypes and phenotypes [2]. Kumar et al BMC Bioinformatics (2018) 19:128 commonly used These platforms can simultaneously identify and quantify hundreds of metabolites. Subsequent metabolomics data analysis should consider the presence of these problems in the given data

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.