Imputation of label-free quantitative mass spectrometry-based proteomics data using self-supervised deep learning

Henry Webel,Henry Webel,Lili Niu,Annelaura Bach Nielsen,Marie Locard-Paulet,Marie Locard-Paulet,Matthias Mann,Matthias Mann,Lars Juhl Jensen,Simon Rasmussen,Simon Rasmussen,Simon Rasmussen

doi:10.1038/s41467-024-48711-5

Abstract

Imputation techniques provide means to replace missing measurements with a value and are used in almost all downstream analysis of mass spectrometry (MS) based proteomics data using label-free quantification (LFQ). Here we demonstrate how collaborative filtering, denoising autoencoders, and variational autoencoders can impute missing values in the context of LFQ at different levels. We applied our method, proteomics imputation modeling mass spectrometry (PIMMS), to an alcohol-related liver disease (ALD) cohort with blood plasma proteomics data available for 358 individuals. Removing 20 percent of the intensities we were able to recover 15 out of 17 significant abundant protein groups using PIMMS-VAE imputations. When analyzing the full dataset we identified 30 additional proteins (+13.2%) that were significantly differentially abundant across disease stages compared to no imputation and found that some of these were predictive of ALD progression in machine learning models. We, therefore, suggest the use of deep learning approaches for imputing missing values in MS-based proteomics on larger datasets and provide workflows for these.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nature Communications	Publication Date: Jun 26, 2024
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Imputation of label-free quantitative mass spectrometry-based proteomics data using self-supervised deep learning

Abstract

Talk to us

Similar Papers

More From: Nature Communications

Lead the way for us

Similar Papers

Molecular fingerprinting of the podocyte reveals novel gene and protein regulatory networks
Melanie Boerries ... Tobias B Huber
Kidney International | VOL. 83
Melanie Boerries, et. al.Melanie Boerries ... Tobias B Huber
01 Jun 2013
Kidney International | VOL. 83

SelfMin: Self-Supervised Deep Learning for Advanced Mineralogical Analysis
Ardiansyah Koeshidayatullah ... Ivan Ferreira
-
Ardiansyah Koeshidayatullah, et. al.Ardiansyah Koeshidayatullah ... Ivan Ferreira
15 May 2023
15 May 2023

Pluto: A global volcanic activity early warning system powered by large scale self-supervised deep learning on InSAR data
Nikolaos Ioannis Bountos ... Ioannis Papoutsis
-
Nikolaos Ioannis Bountos, et. al.Nikolaos Ioannis Bountos ... Ioannis Papoutsis
15 May 2023
15 May 2023

Self-supervised learning with large-scale web image mining for characterizing glomerular lesions
Tianyuan Yao ... Richard M Levenson
-
Tianyuan Yao, et. al.Tianyuan Yao ... Richard M Levenson
04 Apr 2022
04 Apr 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Imputation of label-free quantitative mass spectrometry-based proteomics data using self-supervised deep learning

Abstract

Talk to us

Similar Papers

More From: Nature Communications