Protein Inference Research Articles

Abstract Although mass spectrometry is powerful for proteomic quantification, the inherent issue of missing values remains a significant challenge. We found a prevalent lack of quantification of valuable proteins (e.g., immune cell markers and drug targets) in published datasets, limiting the functional utilities of quantitative proteomic profiles. This was exemplified by the substantial lack of quantification of immune cell marker, which impede the dissection of tissue-infiltrating immune cells by deconvolution analysis of cancer proteomic profiles. Thus, we introduce a network-based method NUWA-ms for robust abundance inference of missing proteins using multi-cohort proteomic profiles. By evaluating 561 tumors with paired proteomic and transcriptomic profiles, a significant improvement of deconvolution performance was shown with the aid of NUWA-ms. This was further validated by the comparison between scRNA-seq and proteomic analyses of a same gastric cancer tumor tissue. NUWA-ms applications to cancer proteomic profiles facilitated the inference of CD8+ T cell markers and effector proteins, enabling the associations of CD8+ T cell infiltration with MSI status in colorectal cancer and anti-PD-1 therapy response in melanoma. These findings demonstrated the utilities of NUWA-ms in depicting infiltrating lymphocytes in the tumor microenvironment using proteomic profiles. Furthermore, based on the proteomic profiles of six patient-derived tumor organoids (PDOs), NUWA-ms application helped identifying PDOs sensitive to a targeted therapeutic agent including the ones neglected by the raw MS quantification, and aided in revealing the resistant mechanism. Together, NUWA-ms could enable a robust inference of protein abundance, to help identifying protein biomarkers and therapeutic targets for cancer and other complex diseases. Citation Format: Lihua Cao, Yuhao Xie, Jiale Chen, Man Wang, Tingting Zhao, Heli Yang, Yang Du, Yang Yang, Zhaode Bu, Jiafu Ji, Jianmin Wu. NUWA-ms: A network-based method to infer quantification of missing proteins using multi-cohort proteomics profiles [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2024; Part 1 (Regular Abstracts); 2024 Apr 5-10; San Diego, CA. Philadelphia (PA): AACR; Cancer Res 2024;84(6_Suppl):Abstract nr 2325.

Read full abstract

BackgroundThe high diversity and complexity of the microbial community make it a formidable challenge to identify and quantify the large number of proteins expressed in the community. Conventional metaproteomics approaches largely rely on accurate identification of the MS/MS spectra to their corresponding short peptides in the digested samples, followed by protein inference and subsequent taxonomic and functional analysis of the detected proteins. These approaches are dependent on the availability of protein sequence databases derived either from sample-specific metagenomic data or from public repositories. Due to the incompleteness and imperfections of these protein sequence databases, and the preponderance of homologous proteins expressed by different bacterial species in the community, this computational process of peptide identification and protein inference is challenging and error-prone, which hinders the comparison of metaproteomes across multiple samples.ResultsWe developed metaSpectraST, an unsupervised and database-independent metaproteomics workflow, which quantitatively profiles and compares metaproteomics samples by clustering experimentally observed MS/MS spectra based on their spectral similarity. We applied metaSpectraST to fecal samples collected from littermates of two different mother mice right after weaning. Quantitative proteome profiles of the microbial communities of different mice were obtained without any peptide-spectrum identification and used to evaluate the overall similarity between samples and highlight any differentiating markers. Compared to the conventional database-dependent metaproteomics analysis, metaSpectraST is more successful in classifying the samples and detecting the subtle microbiome changes of mouse gut microbiomes post-weaning. metaSpectraST could also be used as a tool to select the suitable biological replicates from samples with wide inter-individual variation.ConclusionsmetaSpectraST enables rapid profiling of metaproteomic samples quantitatively, without the need for constructing the protein sequence database or identification of the MS/MS spectra. It maximally preserves information contained in the experimental MS/MS spectra by clustering all of them first and thus is able to better profile the complex microbial communities and highlight their functional changes, as compared with conventional approaches. tag the videobyte in this section as ESM45SRGQ1P3pEwLWJemohrVpiVideo

Read full abstract

Protein Inference Research Articles

Related Topics

Articles published on Protein Inference

GraphPI: Efficient Protein Inference with Graph Neural Networks.

Proteome-wide copy-number estimation from transcriptomics

PowerNovo: de novo peptide sequencing via tandem mass spectrometry using an ensemble of transformer and BERT models

MS-PyCloud: A Cloud Computing-Based Pipeline for Proteomic and Glycoproteomic Data Analyses.

Abstract 2325: NUWA-ms: A network-based method to infer quantification of missing proteins using multi-cohort proteomics profiles

AI-Assisted Processing Pipeline to Boost Protein Isoform Detection.

WOMBAT-P: Benchmarking Label-Free Proteomics Data Analysis Workflows.

Leveraging genomic redundancy to improve inference and alignment of orthologous proteins.

MetaSpectraST: an unsupervised and database-independent analysis workflow for metaproteomic MS/MS data using spectrum clustering

ProInfer: An interpretable protein inference tool leveraging on biological networks.

Trans-Proteomic Pipeline: Robust Mass Spectrometry-Based Proteomics Data Analysis Suite.

Overview and considerations in bottom-up proteomics.

How Often Does Filtering of Alignment Columns Improve the Phylogenetic Inference of Two-Domain Proteins?

A Common Target of Nitrite and Nitric Oxide for Respiration Inhibition in Bacteria.

MetaLP: An integrative linear programming method for protein inference in metaproteomics.

Characterization of peptide-protein relationships in protein ambiguity groups via bipartite graphs.

Earliest Photic Zone Niches Probed by Ancestral Microbial Rhodopsins.

VIQoR: a web service for visually supervised protein inference and protein quantification.

Enhanced protein isoform characterization through long-read proteogenomics

Putting Humpty Dumpty Back Together Again: What Does Protein Quantification Mean in Bottom-Up Proteomics?

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Protein Inference Research Articles

Related Topics

Articles published on Protein Inference

GraphPI: Efficient Protein Inference with Graph Neural Networks.

Proteome-wide copy-number estimation from transcriptomics

PowerNovo: de novo peptide sequencing via tandem mass spectrometry using an ensemble of transformer and BERT models

MS-PyCloud: A Cloud Computing-Based Pipeline for Proteomic and Glycoproteomic Data Analyses.

Abstract 2325: NUWA-ms: A network-based method to infer quantification of missing proteins using multi-cohort proteomics profiles

AI-Assisted Processing Pipeline to Boost Protein Isoform Detection.

WOMBAT-P: Benchmarking Label-Free Proteomics Data Analysis Workflows.

Leveraging genomic redundancy to improve inference and alignment of orthologous proteins.

MetaSpectraST: an unsupervised and database-independent analysis workflow for metaproteomic MS/MS data using spectrum clustering

ProInfer: An interpretable protein inference tool leveraging on biological networks.

Trans-Proteomic Pipeline: Robust Mass Spectrometry-Based Proteomics Data Analysis Suite.

Overview and considerations in bottom-up proteomics.

How Often Does Filtering of Alignment Columns Improve the Phylogenetic Inference of Two-Domain Proteins?

A Common Target of Nitrite and Nitric Oxide for Respiration Inhibition in Bacteria.

MetaLP: An integrative linear programming method for protein inference in metaproteomics.

Characterization of peptide-protein relationships in protein ambiguity groups via bipartite graphs.

Earliest Photic Zone Niches Probed by Ancestral Microbial Rhodopsins.

VIQoR: a web service for visually supervised protein inference and protein quantification.

Enhanced protein isoform characterization through long-read proteogenomics

Putting Humpty Dumpty Back Together Again: What Does Protein Quantification Mean in Bottom-Up Proteomics?