Abstract

BackgroundRejection of false positive peptide matches in database searches of shotgun proteomic experimental data is highly desirable. Several methods have been developed to use the peptide retention time as to refine and improve peptide identifications from database search algorithms. This report describes the implementation of an automated approach to reduce false positives and validate peptide matches.ResultsA robust linear regression based algorithm was developed to automate the evaluation of peptide identifications obtained from shotgun proteomic experiments. The algorithm scores peptides based on their predicted and observed reversed-phase liquid chromatography retention times. The robust algorithm does not require internal or external peptide standards to train or calibrate the linear regression model used for peptide retention time prediction. The algorithm is generic and can be incorporated into any database search program to perform automated evaluation of the candidate peptide matches based on their retention times. It provides a statistical score for each peptide match based on its retention time.ConclusionAnalysis of peptide matches where the retention time score was included resulted in a significant reduction of false positive matches with little effect on the number of true positives. Overall higher sensitivities and specificities were achieved for database searches carried out with MassMatrix, Mascot and X!Tandem after implementation of the retention time based score algorithm.

Highlights

  • Rejection of false positive peptide matches in database searches of shotgun proteomic experimental data is highly desirable

  • The goal is to improve the confidence in peptide identification and lower the number of false positive matches returned by database search programs

  • For Mascot research results, 66.85% of false positives were filtered with a threshold of 0.01 for CRT, whereas only 2.48% of true positives were filtered with the same threshold (Figure 7a)

Read more

Summary

Introduction

Rejection of false positive peptide matches in database searches of shotgun proteomic experimental data is highly desirable. Several methods have been developed to use the peptide retention time as to refine and improve peptide identifications from database search algorithms. High-performance liquid chromatography (HPLC) coupled with tandem mass spectrometry (LC-MS/MS) is most commonly used in shotgun proteomics to resolve and identify proteolytic peptides generated from complex protein mixtures [1]. Automated database searching and de novo sequencing algorithms are routinely used to convert the MS/MS data into peptide and protein identifications (page number not for citation purposes). Database search algorithms are more commonly used at this time due to their relatively low computational expense and higher compatibility with low mass accuracy and low quality MS/MS data [3,4]

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call