Covariation of Peptide Abundances Accurately Reflects Protein Concentration Differences

Bo Zhang,Mohammad Pirmoradian,Roman Zubarev,Lukas Käll

doi:10.1074/mcp.o117.067728

Abstract

Most implementations of mass spectrometry-based proteomics involve enzymatic digestion of proteins, expanding the analysis to multiple proteolytic peptides for each protein. Currently, there is no consensus of how to summarize peptides' abundances to protein concentrations, and such efforts are complicated by the fact that error control normally is applied to the identification process, and do not directly control errors linking peptide abundance measures to protein concentration. Peptides resulting from suboptimal digestion or being partially modified are not representative of the protein concentration. Without a mechanism to remove such unrepresentative peptides, their abundance adversely impacts the estimation of their protein's concentration. Here, we present a relative quantification approach, Diffacto, that applies factor analysis to extract the covariation of peptides' abundances. The method enables a weighted geometrical average summarization and automatic elimination of incoherent peptides. We demonstrate, based on a set of controlled label-free experiments using standard mixtures of proteins, that the covariation structure extracted by the factor analysis accurately reflects protein concentrations. In the 1% peptide-spectrum match-level FDR data set, as many as 11% of the peptides have abundance differences incoherent with the other peptides attributed to the same protein. If not controlled, such contradicting peptide abundance have a severe impact on protein quantifications. When adding the quantities of each protein's three most abundant peptides, we note as many as 14% of the proteins being estimated as having a negative correlation with their actual concentration differences between samples. Diffacto reduced the amount of such obviously incorrectly quantified proteins to 1.6%. Furthermore, by analyzing clinical data sets from two breast cancer studies, our method revealed the persistent proteomic signatures linked to three subtypes of breast cancer. We conclude that Diffacto can facilitate the interpretation and enhance the utility of most types of proteomics data.

Highlights

Because a typical experiment involves proteolytic digestion, the actual analytes measured by liquid chromatography-tandem mass spectrometry (LC-MS/MS)1 are the proteolytic peptides of the analyzed proteins
The transformation of abundance scale balanced the contributions of peptides for each protein, despite the vastly different ion-intensities observed in LC-MS/MS experiments
Based on the proportionality principle, every observed peptide abundance should be a combination of two parts: the signal responding to the relative change of protein concentration (z), plus the noise (␧) mainly caused by measurement errors

Summary

Introduction

By putting more trust in peptides that demonstrate a stronger covariation with the other peptides from the same protein, one can make better use of the proportionality principle Utilizing such information about covariation, other approaches have been shown to improve the validity of protein inference and signal integration (16 –18), or provide a basis for selecting peptides for quantitative analysis [19, 20]. These approaches have drawbacks in terms of dependences toward specific quantification techniques or the difficulty with handling missing values; and often incorrectly treat all peptides as independent variables when summarizing each individual LC-MS/MS experiment

Objectives

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Molecular & Cellular Proteomics	Publication Date: May 1, 2017
Citations: 69	License type: cc-by

R Discovery Prime

R Discovery Prime

Covariation of Peptide Abundances Accurately Reflects Protein Concentration Differences

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Molecular & Cellular Proteomics

Lead the way for us

Similar Papers

Statistical Approach to Protein Quantification
Sarah Gerster ... Peter Bühlmann
Molecular & Cellular Proteomics | VOL. 13
Sarah Gerster, et. al.Sarah Gerster ... Peter Bühlmann
01 Feb 2014
Molecular & Cellular Proteomics | VOL. 13

Simultaneous Integration of Multi-omics Data Improves the Identification of Cancer Driver Modules.
Dana Silverbush ... Simona Cristea
Cell Systems | VOL. 8
Dana Silverbush, et. al.Dana Silverbush ... Simona Cristea
01 May 2019
Cell Systems | VOL. 8

Abstract PR12: Directing personalized breast cancer treatment with pathway signatures
Jing-Ru Jhan ... Eran R Andrechek
Molecular Cancer Research | VOL. 12
Jing-Ru Jhan, et. al.Jing-Ru Jhan ... Eran R Andrechek
01 Nov 2014
Molecular Cancer Research | VOL. 12

Abstract A44: Directing personalized breast cancer treatment with pathway signatures
Jing-Ru Jhan ... Eran Andrechek
Molecular Cancer Research | VOL. 12
Jing-Ru Jhan, et. al.Jing-Ru Jhan ... Eran Andrechek
01 Nov 2014
Molecular Cancer Research | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Covariation of Peptide Abundances Accurately Reflects Protein Concentration Differences

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Molecular &amp; Cellular Proteomics

More From: Molecular & Cellular Proteomics