Targeted Feature Detection for Data-Dependent Shotgun Proteomics

Hendrik Weisser,Jyoti S Choudhary

doi:10.1021/acs.jproteome.7b00248

Hendrik Weisser, Jyoti S Choudhary

Open Access

https://doi.org/10.1021/acs.jproteome.7b00248

Copy DOI

Journal: Journal of Proteome Research	Publication Date: Jul 19, 2017
Citations: 44	License type: cc-by

Affiliation: Wellcome Sanger Institute

Abstract

Label-free quantification of shotgun LC–MS/MS data is the prevailing approach in quantitative proteomics but remains computationally nontrivial. The central data analysis step is the detection of peptide-specific signal patterns, called features. Peptide quantification is facilitated by associating signal intensities in features with peptide sequences derived from MS2 spectra; however, missing values due to imperfect feature detection are a common problem. A feature detection approach that directly targets identified peptides (minimizing missing values) but also offers robustness against false-positive features (by assigning meaningful confidence scores) would thus be highly desirable. We developed a new feature detection algorithm within the OpenMS software framework, leveraging ideas and algorithms from the OpenSWATH toolset for DIA/SRM data analysis. Our software, FeatureFinderIdentification (“FFId”), implements a targeted approach to feature detection based on information from identified peptides. This information is encoded in an MS1 assay library, based on which ion chromatogram extraction and detection of feature candidates are carried out. Significantly, when analyzing data from experiments comprising multiple samples, our approach distinguishes between “internal” and “external” (inferred) peptide identifications (IDs) for each sample. On the basis of internal IDs, two sets of positive (true) and negative (decoy) feature candidates are defined. A support vector machine (SVM) classifier is then trained to discriminate between the sets and is subsequently applied to the “uncertain” feature candidates from external IDs, facilitating selection and confidence scoring of the best feature candidate for each peptide. This approach also enables our algorithm to estimate the false discovery rate (FDR) of the feature selection step. We validated FFId based on a public benchmark data set, comprising a yeast cell lysate spiked with protein standards that provide a known ground-truth. The algorithm reached almost complete (>99%) quantification coverage for the full set of peptides identified at 1% FDR (PSM level). Compared with other software solutions for label-free quantification, this is an outstanding result, which was achieved at competitive quantification accuracy and reproducibility across replicates. The FDR for the feature selection was estimated at a low 1.5% on average per sample (3% for features inferred from external peptide IDs). The FFId software is open-source and freely available as part of OpenMS (www.openms.org).

Highlights

Shotgun proteomics enables the identification and quantification of proteins in complex mixtures in a high-throughput fashion.[1]
We present here a new algorithm for targeted feature detection, termed FeatureFinderIdentification (FFId), that we developed within the OpenMS software framework.[15]
Shotgun proteomics based on dependent acquisition (DDA) is a very mature technology and the method of choice for unbiased, discovery-driven studies of the proteome

Summary

Introduction

Shotgun proteomics enables the identification and quantification of proteins in complex mixtures in a high-throughput fashion.[1] After enzymatic digestion of the proteins, resulting peptides are analyzed by liquid chromatography coupled to tandem mass spectrometry (LC−MS/MS). In data-dependent acquisition (DDA) mode, the MS instrument acquires precursor ion spectra (MS1) and selects candidate precursors based on charge and intensity for fragmentation and fragment ion spectra (MS2) acquisition. To quantify peptides in LC− MS/MS data and thereby enable the inference of protein abundances, the identities of peptides and the corresponding quantitative measures both have to be available. The quantitative information is contained in the MS1 spectra, and sequence information required for identification is captured in the MS2 spectra. DDA allows for a discovery-driven approach to protein identification and quantification that requires only minimal preparation or a priori knowledge of the sample

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Targeted Feature Detection for Data-Dependent Shotgun Proteomics

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Proteome Research

Lead the way for us

Similar Papers

Normalization and Statistical Analysis of Quantitative Proteomics Data Generated by Metabolic Labeling
Lily Ting ... Ricardo Cavicchioli
Molecular & Cellular Proteomics | VOL. 8
Lily Ting, et. al.Lily Ting ... Ricardo Cavicchioli
01 Oct 2009
Molecular & Cellular Proteomics | VOL. 8

Defining, Comparing, and Improving iTRAQ Quantification in Mass Spectrometry Proteomics Data
Lina Hultin-Rosenberg ... Henrik J Johansson
Molecular & Cellular Proteomics | VOL. 12
Lina Hultin-Rosenberg, et. al.Lina Hultin-Rosenberg ... Henrik J Johansson
01 Jul 2013
Molecular & Cellular Proteomics | VOL. 12

Targeted Quantitative Analysis of Streptococcus pyogenes Virulence Factors by Multiple Reaction Monitoring
Vinzenz Lange ... Ruedi Aebersold
Molecular & cellular proteomics : MCP | VOL. 7
Vinzenz Lange, et. al.Vinzenz Lange ... Ruedi Aebersold
01 Aug 2008
Molecular & cellular proteomics : MCP | VOL. 7

Combining Results of Multiple Search Engines in Proteomics
David Shteynberg ... Eric W Deutsch
Molecular & Cellular Proteomics | VOL. 12
David Shteynberg, et. al.David Shteynberg ... Eric W Deutsch
01 Sep 2013
Molecular & Cellular Proteomics | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Targeted Feature Detection for Data-Dependent Shotgun Proteomics

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Proteome Research