Abstract

We present a workflow using an ETD-optimised version of Mascot Percolator and a modified version of SLoMo (turbo-SLoMo) for analysis of phosphoproteomic data. We have benchmarked this against several database searching algorithms and phosphorylation site localisation tools and show that it offers highly sensitive and confident phosphopeptide identification and site assignment with PSM-level statistics, enabling rigorous comparison of data acquisition methods. We analysed the Plasmodium falciparum schizont phosphoproteome using for the first time, a data-dependent neutral loss-triggered-ETD (DDNL) strategy and a conventional decision-tree method. At a posterior error probability threshold of 0.01, similar numbers of PSMs were identified using both methods with a 73% overlap in phosphopeptide identifications. The false discovery rate associated with spectral pairs where DDNL CID/ETD identified the same phosphopeptide was < 1%. 72% of phosphorylation site assignments using turbo-SLoMo without any score filtering, were identical and 99.8% of these cases are associated with a false localisation rate of < 5%. We show that DDNL acquisition is a useful approach for phosphoproteomics and results in an increased confidence in phosphopeptide identification without compromising sensitivity or duty cycle. Furthermore, the combination of Mascot Percolator and turbo-SLoMo represents a robust workflow for phosphoproteomic data analysis using CID and ETD fragmentation.Biological significanceProtein phosphorylation is a ubiquitous post-translational modification that regulates protein function. Mass spectrometry-based approaches have revolutionised its analysis on a large-scale but phosphorylation sites are often identified by single phosphopeptides and therefore require more rigorous data analysis to unsure that sites are identified with high confidence for follow-up experiments to investigate their biological significance. The coverage and confidence of phosphoproteomic experiments can be enhanced by the use of multiple complementary fragmentation methods. Here we have benchmarked a data analysis pipeline for analysis of phosphoproteomic data generated using CID and ETD fragmentation and used it to demonstrate the utility of a data-dependent neutral loss triggered ETD fragmentation strategy for high confidence phosphopeptide identification and phosphorylation site localisation.

Highlights

  • Protein phosphorylation is the most studied post-translational modifications and it regulates most biological processes

  • We reanalysed published human phosphoproteomic data, which was acquired by sequential collision induced dissociation (CID) and electron transfer dissociation (ETD) analysis on an Orbitrap mass spectrometer [30], using Mascot Percolator. 5277 and 3543 phosphopeptide peptide spectrum match (PSM) (0.01 q-value threshold) were identified for CID and ETD datasets, respectively (Fig. 1)

  • This represents an increase in significant assignments of 45.5% and 53.1% compared to Mascot, 137.2% and 280.9% compared to SEQUEST and 84.8% and 304.5% compared to X! Tandem for CID and ETD data, respectively

Read more

Summary

Introduction

Protein phosphorylation is the most studied post-translational modifications and it regulates most biological processes. The majority of database searching algorithms were built to analyse data from CID spectra and over the years as alternative fragmentation methods have been developed, they have been modified to allow for the different ion series produced by these methods This integration of different fragmentation types has been implemented in post-search algorithms such as Mascot Percolator [17,18] and Protein Prospector [19], allowing scoring and accurate FDR calculations to be determined in a single pipeline regardless of fragmentation type. We set out to explore workflows for large scale phosphoproteomics using combinations of CID and ETD fragmentation and evaluation of a single data analysis pipeline comprising of Mascot Percolator and a modified version of the site localisation tool, SLoMo (turbo-SLoMo). We use the combined Mascot Percolator and turbo-SLoMo data analysis workflow to generate a high confidence P. falciparum schizont phosphoproteome

Preparation of parasites
IMAC purification
Data processing and database searching
Results and discussion
Large scale phosphopeptide identification using Mascot Percolator
Combinations of CID and ETD for phosphopeptide identification
Phosphorylation site assignment using turbo-SLoMo
Conclusions
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.