Confident Peptide Identifications Research Articles

For bottom-up proteomic analysis, the goal of analytical pipelines that process the raw output of mass spectrometers is to detect, characterise, identify, and quantify peptides. The initial steps of detecting and characterising features in raw data must overcome some considerable challenges. The data presents as a sparse array, sometimes containing billions of intensity readings over time. These points represent both signal and chemical or electrical noise. Depending on the biological sample’s complexity, tens to hundreds of thousands of peptides may be present in this vast data landscape. For ion mobility-based LC-MS analysis, each peptide is comprised of a grouping of hundreds of single intensity readings in three dimensions: mass-over-charge (m/z), mobility, and retention time. There is no inherent information about any associations between individual points; whether they represent a peptide or noise must be inferred from their structure. Peptides each have multiple isotopes, different charge states, and a dynamic range of intensity of over six orders of magnitude. Due to the high complexity of most biological samples, peptides often overlap in time and mobility, making it very difficult to tease apart isotopic peaks, to apportion the intensity of each and the contribution of each isotope to the determination of the peptide’s monoisotopic mass, which is critical for the peptide’s identification. Here we describe four algorithms for the Bruker timsTOF Pro that each play an important role in finding peptide features and determining their characteristics. These algorithms focus on separate characteristics that determine how candidate features are detected in the raw data. The first two algorithms deal with the complexity of the raw data, rapidly clustering raw data into spectra that allows isotopic peaks to be resolved. The third algorithm compensates for saturation of the instrument’s detector thereby recovering lost dynamic range, and lastly, the fourth algorithm increases confidence of peptide identifications by simplification of the fragment spectra. These algorithms are effective in processing raw data to detect features and extracting the attributes required for peptide identification, and make an important contribution to an analytical pipeline by detecting features that are higher quality and better segmented from other peptides in close proximity. The software has been developed in Python using Numpy and Pandas and made freely available with an open-source MIT license to facilitate experimentation and further improvement (DOI 10.5281/zenodo.6513126). Data are available via ProteomeXchange with identifier PXD030706.

Read full abstract

One of the most important early developments in the field of proteomics was the advent of automated data acquisition routines that allowed high-throughput unattended data acquisition during HPLC introduction of peptide mixtures to a tandem mass spectrometer. Prior to this, data acquisition was orders of magnitude less efficient being based entirely on lists of predetermined ions generated in a prior HPLC-MS experiment. This process, known generically as data-dependent analysis, empowered the development of shotgun proteomics where hundreds to thousands of peptide sequences are matched per experiment. In their most popular implementation, the most abundant ionized species from every precursor ion scan at each moment in chromatographic time are successively selected for isolation, activation and tandem mass analysis. While extremely powerful, this strategy has one primary limitation in that detectable dynamic range is restricted (in a top-down manner) to the peptides that ionize the best. To circumvent the serial nature of the data-dependent process and increase detectable dynamic range, the concepts of multiplexed and data-independent acquisition (DIA) have emerged. Multiplexed-data acquisition is based on more efficient co-selection and co-dissociation of multiple precursor ions in parallel, the data from which is subsequently de-convoluted to provide polypeptide sequences for each individual precursor ion. DIA has similar goals, but there is no real-time ion selection based on prior precursor ion scans. Instead, predefined m/z ranges are interrogated either by fragmenting all ions entering the mass spectrometer at every single point in chromatographic time; or by dividing the m/z range into smaller m/z ranges for isolation and fragmentation. These approaches aim to fully utilize the capabilities of mass spectrometers to maximize tandem MS acquisition time and to address the need to expand the detectable dynamic range, lower the limit of detection, and improve the overall confidence of peptide identifications and relative protein quantification measurements. This review covers all aspects of multiplexed- and data-independent tandem mass spectrometry in proteomics, from experimental implementations to advances in software for data interpretation.

Read full abstract

Confident Peptide Identifications Research Articles

Related Topics

Articles published on Confident Peptide Identifications

Utilizing Precursor Ion Connectivity of Different Charge States to Improve Peptide and Protein Identification in MS/MS Analysis.

Simplifying MS1 and MS2 spectra to achieve lower mass error, more dynamic range, and higher peptide identification confidence on the Bruker timsTOF Pro.

TIDD: tool-independent and data-dependent machine learning for peptide identification

MS-Decipher: a user-friendly proteome database search software with an emphasis on deciphering the spectra of O-linked glycopeptides.

Targeted Mass Spectrometry Analysis of Protein Phosphorylation by Selected Ion Monitoring Coupled to Parallel Reaction Monitoring (tSIM/PRM).

Escherichia coli and Sf9 Contaminant Databases to Increase Efficiency of Tandem Mass Spectrometry Peptide Identification in Structural Mass Spectrometry Experiments.

Probabilistic Limit of Detection for Ricin Identification Using a Shotgun Proteomics Assay.

ETD-Cleavable Linker for Confident Cross-linked Peptide Identifications.

Active Instrument Engagement Combined with a Real-Time Database Search for Improved Performance of Sample Multiplexing Workflows.

Confidence assignment for mass spectrometry based peptide identifications via the extreme value distribution.

Preserved Proteins from Extinct Bison latifrons Identified by Tandem Mass Spectrometry; Hydroxylysine Glycosides are a Common Feature of Ancient Collagen

Conserved Peptide Fragmentation as a Benchmarking Tool for Mass Spectrometers and a Discriminating Feature for Targeted Proteomics

Prediction of Peptide Fragment Ion Mass Spectra by Data Mining Techniques

Characterization of intact N- and O-linked glycopeptides using higher energy collisional dissociation

Multiplexed and data‐independent tandem mass spectrometry for global proteome profiling

HCD-only fragmentation method balances peptide identification and quantitation of TMT-labeled samples in hybrid linear ion trap/orbitrap mass spectrometers

Optimized Nonlinear Gradients for Reversed-Phase Liquid Chromatography in Shotgun Proteomics

Improving Qualitative and Quantitative Performance for MSE-based Label-free Proteomics

Properties of isotope patterns and their utility for peptide identification in large‐scale proteomic experiments

Data-Dependent Middle-Down Nano-Liquid Chromatography–Electron Capture Dissociation-Tandem Mass Spectrometry: An Application for the Analysis of Unfractionated Histones

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Confident Peptide Identifications Research Articles

Related Topics

Articles published on Confident Peptide Identifications

Utilizing Precursor Ion Connectivity of Different Charge States to Improve Peptide and Protein Identification in MS/MS Analysis.

Simplifying MS1 and MS2 spectra to achieve lower mass error, more dynamic range, and higher peptide identification confidence on the Bruker timsTOF Pro.

TIDD: tool-independent and data-dependent machine learning for peptide identification

MS-Decipher: a user-friendly proteome database search software with an emphasis on deciphering the spectra of O-linked glycopeptides.

Targeted Mass Spectrometry Analysis of Protein Phosphorylation by Selected Ion Monitoring Coupled to Parallel Reaction Monitoring (tSIM/PRM).

Escherichia coli and Sf9 Contaminant Databases to Increase Efficiency of Tandem Mass Spectrometry Peptide Identification in Structural Mass Spectrometry Experiments.

Probabilistic Limit of Detection for Ricin Identification Using a Shotgun Proteomics Assay.

ETD-Cleavable Linker for Confident Cross-linked Peptide Identifications.

Active Instrument Engagement Combined with a Real-Time Database Search for Improved Performance of Sample Multiplexing Workflows.

Confidence assignment for mass spectrometry based peptide identifications via the extreme value distribution.

Preserved Proteins from Extinct Bison latifrons Identified by Tandem Mass Spectrometry; Hydroxylysine Glycosides are a Common Feature of Ancient Collagen

Conserved Peptide Fragmentation as a Benchmarking Tool for Mass Spectrometers and a Discriminating Feature for Targeted Proteomics

Prediction of Peptide Fragment Ion Mass Spectra by Data Mining Techniques

Characterization of intact N- and O-linked glycopeptides using higher energy collisional dissociation

Multiplexed and data‐independent tandem mass spectrometry for global proteome profiling

HCD-only fragmentation method balances peptide identification and quantitation of TMT-labeled samples in hybrid linear ion trap/orbitrap mass spectrometers

Optimized Nonlinear Gradients for Reversed-Phase Liquid Chromatography in Shotgun Proteomics

Improving Qualitative and Quantitative Performance for MSE-based Label-free Proteomics

Properties of isotope patterns and their utility for peptide identification in large‐scale proteomic experiments

Data-Dependent Middle-Down Nano-Liquid Chromatography–Electron Capture Dissociation-Tandem Mass Spectrometry: An Application for the Analysis of Unfractionated Histones