In-depth method assessments of differentially expressed protein detection for shotgun proteomics data with missing values

Jinxia Wang,Yunping Zhu,Tao Chen,Cheng Chang,Liwei Li,Jujuan Zhuang,Jie Ma

doi:10.1038/s41598-017-03650-8

Abstract

Considering as one of the major goals in quantitative proteomics, detection of the differentially expressed proteins (DEPs) plays an important role in biomarker selection and clinical diagnostics. There have been plenty of algorithms and tools focusing on DEP detection in proteomics research. However, due to the different application scopes of these methods, and various kinds of experiment designs, it is not very apparent about the best choice for large-scale proteomics data analyses. Moreover, given the fact that proteomics data usually contain high percentage of missing values (MVs), but few replicates, a systematic evaluation of the DEP detection methods combined with the MV imputation methods is essential and urgent. Here, we analyzed a total of four representative imputation methods and five DEP methods on different experimental and simulated datasets. The results showed that (i) MV imputation could not always improve the performances of DEP detection methods and the imputation effects differed in the missing value percentages; (ii) the DEP detection methods had different statistical powers affected by the percentage of MVs. Two statistical methods (i.e. the empirical Bayesian random censoring threshold model, and the significance analysis of microarray) performed better than the other evaluated methods in terms of accuracy and sensitivity.

Highlights

Due to the rapid improvement of high resolution mass spectrometers, the focus of proteomics research is changing from qualitative to quantitative analyses[1]
Even if some differentially expressed proteins (DEPs) detection methods might be applied to a dataset containing missing values (MVs), their statistical powers tend to be limited by the wide dynamic percentage of MVs in the proteomics data
Four popular imputation methods and five representative DEP detection methods were comprehensively evaluated on two experimental datasets and nine simulated datasets to answer three scientific questions: (1) What’s the maximum MV percentage of a dataset that imputation methods can handle? (2) To what extent, the imputation could affect the performances of the DEP detection methods? (3) Among the combinations of MV imputation and DEP detection methods, which one is more suitable for proteomics data?

Summary

Introduction

Due to the rapid improvement of high resolution mass spectrometers, the focus of proteomics research is changing from qualitative to quantitative analyses[1]. It is of great significance to accurately determine the protein expression levels and detect DEPs in different experimental conditions (groups or samples) in quantitative proteomics. Even if some DEP detection methods might be applied to a dataset containing MVs, their statistical powers tend to be limited by the wide dynamic percentage of MVs in the proteomics data. Webb-Robertson et al.[9] has reviewed some selected imputation methods for label-free quantitative proteomics, but the influences of these imputation strategies on the subsequent DEP detection algorithms were not considered. A systematic evaluation of DEP detection methods and MV imputation methods was performed for different experimental designs containing different replicates and MV percentages. Our aim is to evaluate the statistical powers of DEP detection methods before and after MV imputation. Four popular imputation methods and five representative DEP detection methods were comprehensively evaluated on two experimental datasets and nine simulated datasets to answer three scientific questions: (1) What’s the maximum MV percentage of a dataset that imputation methods can handle? (2) To what extent, the imputation could affect the performances of the DEP detection methods? (3) Among the combinations of MV imputation and DEP detection methods, which one is more suitable for proteomics data?

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: Jun 13, 2017
Citations: 30	License type: open-access

R Discovery Prime

R Discovery Prime

In-depth method assessments of differentially expressed protein detection for shotgun proteomics data with missing values

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

Dealing with missing values in proteomics data.
Weijia Kong ... Wilson Wen Bin Goh
PROTEOMICS | VOL. 22
Weijia Kong, et. al.Weijia Kong ... Wilson Wen Bin Goh
17 Nov 2022
PROTEOMICS | VOL. 22

Incomplete data ensemble classification using imputation-revision framework with local spatial neighborhood information
Yuanting Yan ... Yanping Zhang
Applied Soft Computing | VOL. 99
Yuanting Yan, et. al.Yuanting Yan ... Yanping Zhang
13 Nov 2020
Applied Soft Computing | VOL. 99

Editor's evaluation: Deep proteome profiling reveals signatures of age and sex differences in paw skin and sciatic nerve of naïve mice
Jungmin Choi
-
Jungmin ChoiJungmin Choi
17 Aug 2022
17 Aug 2022

RMisbeta: A robust missing value imputation approach in transcriptomics and metabolomics data
Md Shahjaman ... Md Nurul Haque Mollah
Computers in Biology and Medicine | VOL. 138
Md Shahjaman, et. al.Md Shahjaman ... Md Nurul Haque Mollah
29 Sep 2021
Computers in Biology and Medicine | VOL. 138

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

In-depth method assessments of differentially expressed protein detection for shotgun proteomics data with missing values

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports