A Novel Ultrahigh-Resolution Y-Injection Multireflecting Time-of-Flight Mass Spectrometer for Bottom-Up Proteomics.
The first results of using a new type of ultrahigh-resolution mass analyzer based on a planar multipass time-of-flight mass spectrometer with periodic reflecting lenses (Y-MRT MS) for bottom-up whole-proteome analysis are presented. The instrument achieves a resolving power in a range of 600,000-800,000 for peptide ions across the whole m/z range, with a high repetition rate of 300 Hz (averaged to 0.5-4 Hz for enhanced dynamic range). In preliminary experiments for human cell lines, MCF-7 and HeLa, single-shot 30 min gradient HPLC separations of 1 μg proteolytic digests yielded, on average, over 4000 protein groups in MS/MS-free proteome analyses using the DirectMS1 method. Combining three technical runs increased these numbers to 4500 protein groups at 1% FDR. Peptide ion mass measurements demonstrated an accuracy of 70-130 ppb across the whole m/z range, with a dynamic range exceeding 104. In DIA mode (SWATH-DIA, 20 Th window, 30 min gradient), 4350 protein IDs were obtained at 1% FDR on average in single-shot LC-MS/MS runs. These results highlight the Y-MRT mass analyzer's potential for bottom-up proteomics. Further improvements in proteome coverage and analysis time are anticipated with optimized HPLC configurations and the integration of gas-phase ion mobility separation.
- Research Article
93
- 10.1016/j.cels.2022.02.003
- Mar 16, 2022
- Cell systems
SUMMARYSingle-cell proteomics (scProteomics) promises to advance our understanding of cell functions within complex biological systems. However, a major challenge of current methods is their inability to identify and provide accurate quantitative information for low-abundance proteins. Herein, we describe an ion-mobility-enhanced mass spectrometry acquisition and peptide identification method, transferring identification based on FAIMS filtering (TIFF), to improve the sensitivity and accuracy of label-free scProteomics. TIFF extends the ion accumulation times for peptide ions by filtering out singly charged ions. The peptide identities are assigned by a three-dimensional MS1 feature matching approach (retention time, accurate mass, and FAIMS compensation voltage). The TIFF method enabled unbiased proteome analysis to a depth of >1,700 proteins in single HeLa cells, with >1,100 proteins consistently identified. As a demonstration, we applied the TIFF method to obtain temporal proteome profiles of >150 single murine macrophage cells during lipopolysaccharide stimulation and identified time-dependent proteome changes. A record of this paper’s transparent peer review process is included in the supplemental information.
- Research Article
306
- 10.1074/mcp.r500005-mcp200
- Mar 1, 2005
- Molecular & Cellular Proteomics
The combined method of LC-MS/MS is increasingly being used to explore differences in the proteomic composition of complex biological systems. The reliability and utility of such comparative protein expression profiling studies is critically dependent on an accurate and rigorous assessment of quantitative changes in the relative abundance of the myriad of proteins typically present in a biological sample such as blood or tissue. In this review, we provide an overview of key statistical and computational issues relevant to bottom-up shotgun global proteomic analysis, with an emphasis on methods that can be applied to improve the dependability of biological inferences drawn from large proteomic datasets. Focusing on a start-to-finish approach, we address the following topics: 1) low-level data processing steps, such as formation of a data matrix, filtering, and baseline subtraction to minimize noise, 2) mid-level processing steps, such as data normalization, alignment in time, peak detection, peak quantification, peak matching, and error models, to facilitate profile comparisons; and, 3) high-level processing steps such as sample classification and biomarker discovery, and related topics such as significance testing, multiple testing, and choice of feature space. We report on approaches that have recently been developed for these steps, discussing their merits and limitations, and propose areas deserving of further research.
- Supplementary Content
372
- 10.1074/mcp.m600162-mcp200
- Oct 1, 2006
- Molecular & Cellular Proteomics
Recent advances in proteomics technologies provide tremendous opportunities for biomarker-related clinical applications; however, the distinctive characteristics of human biofluids such as the high dynamic range in protein abundances and extreme complexity of the proteomes present tremendous challenges. In this review we summarize recent advances in LC-MS-based proteomics profiling and its applications in clinical proteomics as well as discuss the major challenges associated with implementing these technologies for more effective candidate biomarker discovery. Developments in immunoaffinity depletion and various fractionation approaches in combination with substantial improvements in LC-MS platforms have enabled the plasma proteome to be profiled with considerably greater dynamic range of coverage, allowing many proteins at low ng/ml levels to be confidently identified. Despite these significant advances and efforts, major challenges associated with the dynamic range of measurements and extent of proteome coverage, confidence of peptide/protein identifications, quantitation accuracy, analysis throughput, and the robustness of present instrumentation must be addressed before a proteomics profiling platform suitable for efficient clinical applications can be routinely implemented.
- Supplementary Content
168
- 10.1074/mcp.m600068-mcp200
- Oct 1, 2006
- Molecular & Cellular Proteomics
Although human plasma represents an attractive sample for disease biomarker discovery, the extreme complexity and large dynamic range in protein concentrations present significant challenges for characterization, candidate biomarker discovery, and validation. Herein we describe a strategy that combines immunoaffinity subtraction and subsequent chemical fractionation based on cysteinyl peptide and N-glycopeptide captures with two-dimensional LC-MS/MS to increase the dynamic range of analysis for plasma. Application of this "divide-and-conquer" strategy to trauma patient plasma significantly improved the overall dynamic range of detection and resulted in confident identification of 22,267 unique peptides from four different peptide populations (cysteinyl peptides, non-cysteinyl peptides, N-glycopeptides, and non-glycopeptides) that covered 3,654 different proteins with 1,494 proteins identified by multiple peptides. Numerous low abundance proteins were identified, exemplified by 78 "classic" cytokines and cytokine receptors and by 136 human cell differentiation molecules. Additionally a total of 2,910 different N-glycopeptides that correspond to 662 N-glycoproteins and 1,553 N-glycosylation sites were identified. A panel of the proteins identified in this study is known to be involved in inflammation and immune responses. This study established an extensive reference protein database for trauma patients that provides a foundation for future high throughput quantitative plasma proteomic studies designed to elucidate the mechanisms that underlie systemic inflammatory responses.
- Research Article
322
- 10.1074/jbc.m403672200
- Aug 1, 2004
- Journal of Biological Chemistry
The presence of amyloid plaques in the brain is one of the pathological hallmarks of Alzheimer's disease (AD). We report here a comprehensive proteomic analysis of senile plaques from postmortem AD brain tissues. Senile plaques labeled with thioflavin-S were procured by laser capture microdissection, and their protein components were analyzed by liquid chromatography coupled with tandem mass spectrometry. We identified a total of 488 proteins co-isolated with the plaques, and we found multiple phosphorylation sites on the neurofilament intermediate chain, implicating the complexity and diversity of cellular processes involved in the plaque formation. More significantly, we identified 26 proteins enriched in the plaques of two AD cases by quantitative comparison with surrounding non-plaque tissues. The localization of several proteins in the plaques was further confirmed by the approach of immunohistochemistry. In addition to previously identified plaque constituents, we discovered novel association of dynein heavy chain with the plaques in human postmortem brain and in a double transgenic AD mouse model, suggesting that neuronal transport may play a role in neuritic degeneration. Overall, our results revealed for the first time the sub-proteome of amyloid plaques that is important for further studies on disease biomarker identification and molecular mechanisms of AD pathogenesis.
- Research Article
45
- 10.1074/mcp.m110.002774
- Dec 1, 2010
- Molecular & Cellular Proteomics
Liquid chromatography-tandem mass spectrometry (LC-MS/MS)-based proteomics provides a wealth of information about proteins present in biological samples. In bottom-up LC-MS/MS-based proteomics, proteins are enzymatically digested into peptides prior to query by LC-MS/MS. Thus, the information directly available from the LC-MS/MS data is at the peptide level. If a protein-level analysis is desired, the peptide-level information must be rolled up into protein-level information. We propose a principal component analysis-based statistical method, ProPCA, for efficiently estimating relative protein abundance from bottom-up label-free LC-MS/MS data that incorporates both spectral count information and LC-MS peptide ion peak attributes, such as peak area, volume, or height. ProPCA may be used effectively with a variety of quantification platforms and is easily implemented. We show that ProPCA outperformed existing quantitative methods for peptide-protein roll-up, including spectral counting methods and other methods for combining LC-MS peptide peak attributes. The performance of ProPCA was validated using a data set derived from the LC-MS/MS analysis of a mixture of protein standards (the UPS2 proteomic dynamic range standard introduced by The Association of Biomolecular Resource Facilities Proteomics Standards Research Group in 2006). Finally, we applied ProPCA to a comparative LC-MS/MS analysis of digested total cell lysates prepared for LC-MS/MS analysis by alternative lysis methods and show that ProPCA identified more differentially abundant proteins than competing methods.
- Abstract
- 10.1016/j.fertnstert.2006.07.1100
- Sep 1, 2006
- Fertility and Sterility
P-716: Proteomics of human follicular fluid
- Research Article
102
- 10.1074/jbc.r111.239442
- Jul 1, 2011
- Journal of Biological Chemistry
The diverse proteome of an organism arises from such events as single nucleotide substitutions at the DNA level, different RNA processing, and dynamic enzymatic post-translational modifications. This minireview focuses on the measurement of intact proteins to describe the diversity found in proteomes. The field of biological mass spectrometry has steadily advanced, enabling improvements in the characterization of single proteins to proteins derived from cells or tissues. In this minireview, we discuss the basic technology for "top-down" intact protein analysis. Furthermore, examples of studies involved with the qualitative and quantitative analysis of full-length polypeptides are provided.
- Research Article
7
- 10.1161/circgenetics.110.957761
- Jun 1, 2012
- Circulation: Cardiovascular Genetics
Clinical proteomics involves the analysis of protein expression of disease proteomes, with the aim of solving a specific clinical problem. Discoveries made from proteomicbased studies contribute to the growing need for innovative medical diagnostics for disease detection. Taking into consideration the global health burden of cardiac disease, clinical proteomics is a valuable tool to improve risk stratification associated with this disease. In cardiovascular medicine, the identification of novel proteins, or biomarkers, that are differentially expressed in cardiac disease proteomes may enable early detection of the disease state, thereby preventing progression to disease end points. This review outlines various proteomic platforms and their technical advancements and relates these to the cardiovascular sciences. The entire protein complement of the cell, or proteome, is dynamic and changes in response to the disease state. 1 Proteomic-based experiments can be used to characterize such alterations in protein expression during disease progression. 2 With combined improvements in mass spectrometry (MS) technology as well as innovative molecular biology screening tools, there has been widespread growth in the characterization of cardiac disease proteomes. In fact, proteomic technology has been an important tool in the analysis of heart failure (HF), 3,4 cardiac hypertrophy, 5,6 and dilated cardiomyopathy.7 Several of the overarching aims of such studies include providing greater understanding of general biological mechanisms as well as identifying unique proteins that are clinically useful in the detection of cardiac disease in the early stages or potentially used as novel therapeutic targets. Clinical proteomics continues to benefit from advancements in technologies that allow for fast and consistent identification of proteins with corresponding increases in the dynamic range of proteins detectable in the disease proteome. MS-related technologies have improved in their ability to detect low-abundance proteins as well as membrane proteins and have benefited from sample preprocessing strategies that decrease the complexity of large-scale analyses. With the emergence of different methodologies for biomarker identification, the following 3 criteria must be taken into careful consideration to determine whether the protein is, in fact,
- Research Article
1553
- 10.1074/mcp.t500030-mcp200
- Oct 24, 2005
- Molecular & Cellular Proteomics
Mass accuracy is a key parameter of mass spectrometric performance. TOF instruments can reach low parts per million, and FT-ICR instruments are capable of even greater accuracy provided ion numbers are well controlled. Here we demonstrate sub-ppm mass accuracy on a linear ion trap coupled via a radio frequency-only storage trap (C-trap) to the orbitrap mass spectrometer (LTQ Orbitrap). Prior to acquisition of a spectrum, a background ion originating from ambient air is first transferred to the C-trap. Ions forming the MS or MS(n) spectrum are then added to this species, and all ions are injected into the orbitrap for analysis. Real time recalibration on the "lock mass" by corrections of mass shift removes mass error associated with calibration of the mass scale. The remaining mass error is mainly due to imperfect peaks caused by weak signals and is addressed by averaging the mass measurement over the LC peak, weighted by signal intensity. For peptide database searches in proteomics, we introduce a variable mass tolerance and achieve average absolute mass deviations of 0.48 ppm (standard deviation 0.38 ppm) and maximal deviations of less than 2 ppm. For tandem mass spectra we demonstrate similarly high mass accuracy and discuss its impact on database searching. High and routine mass accuracy in a compact instrument will dramatically improve certainty of peptide and small molecule identification.
- Research Article
21
- 10.31635/ccschem.022.202202333
- Oct 22, 2022
- CCS Chemistry
Recent Advances in Single-Cell Metabolomics Based on Mass Spectrometry
- Research Article
56
- 10.1074/mcp.m600320-mcp200
- Jan 1, 2007
- Molecular & Cellular Proteomics
A major limitation in identifying peptides from complex mixtures by shotgun proteomics is the ability of search programs to accurately assign peptide sequences using mass spectrometric fragmentation spectra (MS/MS spectra). Manual analysis is used to assess borderline identifications; however, it is error-prone and time-consuming, and criteria for acceptance or rejection are not well defined. Here we report a Manual Analysis Emulator (MAE) program that evaluates results from search programs by implementing two commonly used criteria: 1) consistency of fragment ion intensities with predicted gas phase chemistry and 2) whether a high proportion of the ion intensity (proportion of ion current (PIC)) in the MS/MS spectra can be derived from the peptide sequence. To evaluate chemical plausibility, MAE utilizes similarity (Sim) scoring against theoretical spectra simulated by MassAnalyzer software (Zhang, Z. (2004) Prediction of low-energy collision-induced dissociation spectra of peptides. Anal. Chem. 76, 3908-3922) using known gas phase chemical mechanisms. The results show that Sim scores provide significantly greater discrimination between correct and incorrect search results than achieved by Sequest XCorr scoring or Mascot Mowse scoring, allowing reliable automated validation of borderline cases. To evaluate PIC, MAE simplifies the DTA text files summarizing the MS/MS spectra and applies heuristic rules to classify the fragment ions. MAE output also provides data mining functions, which are illustrated by using PIC to identify spectral chimeras, where two or more peptide ions were sequenced together, as well as cases where fragmentation chemistry is not well predicted.
- Research Article
219
- 10.1074/mcp.m111.009431
- May 9, 2011
- Molecular & Cellular Proteomics
This article provides an introduction to Fourier transform-based mass spectrometry. The key performance characteristics of Fourier transform-based mass spectrometry, mass accuracy and resolution, are presented in the view of how they impact the interpretation of measurements in proteomic applications. The theory and principles of operation of two types of mass analyzer, Fourier transform ion cyclotron resonance and Orbitrap, are described. Major benefits as well as limitations of Fourier transform-based mass spectrometry technology are discussed in the context of practical sample analysis, and illustrated with examples included as figures in this text and in the accompanying slide set. Comparisons highlighting the performance differences between the two mass analyzers are made where deemed useful in assisting the user with choosing the most appropriate technology for an application. Recent developments of these high-performing mass spectrometers are mentioned to provide a future outlook.
- Research Article
80
- 10.1074/mcp.m115.055384
- Mar 1, 2016
- Molecular & Cellular Proteomics
All large scale LC-MS/MS post-translational methylation site discovery experiments require methylpeptide spectrum matches (methyl-PSMs) to be identified at acceptably low false discovery rates (FDRs). To meet estimated methyl-PSM FDRs, methyl-PSM filtering criteria are often determined using the target-decoy approach. The efficacy of this methyl-PSM filtering approach has, however, yet to be thoroughly evaluated. Here, we conduct a systematic analysis of methyl-PSM FDRs across a range of sample preparation workflows (each differing in their exposure to the alcohols methanol and isopropyl alcohol) and mass spectrometric instrument platforms (each employing a different mode of MS/MS dissociation). Through (13)CD3-methionine labeling (heavy-methyl SILAC) of Saccharomyces cerevisiae cells and in-depth manual data inspection, accurate lists of true positive methyl-PSMs were determined, allowing methyl-PSM FDRs to be compared with target-decoy approach-derived methyl-PSM FDR estimates. These results show that global FDR estimates produce extremely unreliable methyl-PSM filtering criteria; we demonstrate that this is an unavoidable consequence of the high number of amino acid combinations capable of producing peptide sequences that are isobaric to methylated peptides of a different sequence. Separate methyl-PSM FDR estimates were also found to be unreliable due to prevalent sources of false positive methyl-PSMs that produce high peptide identity score distributions. Incorrect methylation site localizations, peptides containing cysteinyl-S-β-propionamide, and methylated glutamic or aspartic acid residues can partially, but not wholly, account for these false positive methyl-PSMs. Together, these results indicate that the target-decoy approach is an unreliable means of estimating methyl-PSM FDRs and methyl-PSM filtering criteria. We suggest that orthogonal methylpeptide validation (e.g. heavy-methyl SILAC or its offshoots) should be considered a prerequisite for obtaining high confidence methyl-PSMs in large scale LC-MS/MS methylation site discovery experiments and make recommendations on how to reduce methyl-PSM FDRs in samples not amenable to heavy isotope labeling. Data are available via ProteomeXchange with the data identifier PXD002857.
- Research Article
155
- 10.1074/mcp.r500004-mcp200
- Jan 31, 2005
- Molecular & Cellular Proteomics
Pancreatic cancer is a uniformly lethal disease that is difficult to diagnose at early stage and even more difficult to cure. In recent years, there has been a substantial interest in applying proteomics technologies to identify protein biomarkers for early detection of cancer. Quantitative proteomic profiling of body fluids, tissues, or other biological samples to identify differentially expressed proteins represents a very promising approach for improving the outcome of this disease. Proteins associated with pancreatic cancer identified through proteomic profiling technologies could be useful as biomarkers for the early diagnosis, therapeutic targets, and disease response markers. In this article, we discuss recent progress and challenges for applying quantitative proteomics technologies for biomarker discovery in pancreatic cancer.