Abstract

We describe pLink 2, a search engine with higher speed and reliability for proteome-scale identification of cross-linked peptides. With a two-stage open search strategy facilitated by fragment indexing, pLink 2 is ~40 times faster than pLink 1 and 3~10 times faster than Kojak. Furthermore, using simulated datasets, synthetic datasets, 15N metabolically labeled datasets, and entrapment databases, four analysis methods were designed to evaluate the credibility of ten state-of-the-art search engines. This systematic evaluation shows that pLink 2 outperforms these methods in precision and sensitivity, especially at proteome scales. Lastly, re-analysis of four published proteome-scale cross-linking datasets with pLink 2 required only a fraction of the time used by pLink 1, with up to 27% more cross-linked residue pairs identified. pLink 2 is therefore an efficient and reliable tool for cross-linking mass spectrometry analysis, and the systematic evaluation methods described here will be useful for future software development.

Highlights

  • We describe pLink 2, a search engine with higher speed and reliability for proteome-scale identification of cross-linked peptides

  • The idea of CXMS had long existed for structural interpretation of proteins, but its practice had been hindered by the lack of software tools

  • We show that the proposed four target-decoy approach (TDA)-independent evaluation methods are indispensable for systematic evaluation of CXMS search engines

Read more

Summary

Introduction

We describe pLink 2, a search engine with higher speed and reliability for proteome-scale identification of cross-linked peptides. Using simulated datasets, synthetic datasets, 15N metabolically labeled datasets, and entrapment databases, four analysis methods were designed to evaluate the credibility of ten state-of-the-art search engines This systematic evaluation shows that pLink 2 outperforms these methods in precision and sensitivity, especially at proteome scales. The n-square problem was tackled by the open search strategy, which considers one cross-linked peptide pair as two single peptides, each bearing a modification of large mass yet unknown composition on linkable residues. This strategy identifies candidates for two single peptides individually and recombine the top scored single peptides into cross-linked pairs based on the known mass of precursor[10,12,13,14,17]. As we proposed earlier, a fragment index was introduced to reduce the number of coarse-scored peptides[30]

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call