Data Imputation in Merged Isobaric Labeling-Based Relative Quantification Datasets.

Nicolai Bjødstrup Palstrøm,Hans Christian Beck,Rune Matthiesen

doi:10.1007/978-1-4939-9744-2_13

Abstract

The data-dependent acquisition in mass spectrometry-based proteomics combined with quantitative analysis using isobaric labeling (iTRAQ and TMT) inevitably introduces missing values in proteomic experiments where a number of LC-runs are combined, especially in the growing field of shotgun clinical proteomics, where the protein profiles from the proteomics analysis of several hundred patient samples are compared and correlated to clinical traits such as a specific disease or disease treatment in order to link specific outcomes to one or more proteins. In the context of clinical research it is evident that missing values in such datasets reduce the power of the downstream statistical analysis therefore may hampers the linking of the expression of disease traits to the expression of specific proteins that may be useful for prognostic, diagnostic, or predictive purposes. In our study, we tested three data imputation approaches initially developed for microarray data for the imputation of missing values in datasets that are generated by several runs of shotgun proteomic experiments and where the data were relative protein abundances based on isobaric tags (iTRAQ and TMT). Our conclusion is that imputation methods based on k Nearest Neighbors successfully impute missing values in datasets with up to 50% missing values.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Data Imputation in Merged Isobaric Labeling-Based Relative Quantification Datasets.

Abstract

Talk to us

Similar Papers

More From: Methods in molecular biology (Clifton, N.J.)

Lead the way for us

Journal: Methods in molecular biology (Clifton, N.J.)	Publication Date: Sep 25, 2019
Citations: 10

Similar Papers

Multiply imputing missing values in data sets with mixed measurement scales using a sequence of generalised linear models
Min Cherng Lee ... Robin Mitra
Computational Statistics & Data Analysis | VOL. 95
Min Cherng Lee, et. al.Min Cherng Lee ... Robin Mitra
09 Sep 2015
Computational Statistics & Data Analysis | VOL. 95

A Hybrid Approach for Missing Data Imputation in Gene Expression Dataset Using Extra Tree Regressor and a Genetic Algorithm
Amarjeet Yadav ... Akhtar Rasool
-
Amarjeet Yadav, et. al.Amarjeet Yadav ... Akhtar Rasool
01 Jan 2023
01 Jan 2023

Handling Missing Values in Chronic Kidney Disease Datasets Using KNN, K-Means and K-Medoids Algorithms
Tahira Mahboob ... Amber Shahzad
-
Tahira Mahboob, et. al.Tahira Mahboob ... Amber Shahzad
01 Dec 2018
01 Dec 2018

Analysis and Visualization of Missing Value Patterns
Bas Van Stein ... Wojtek Kowalczyk
-
Bas Van Stein, et. al.Bas Van Stein ... Wojtek Kowalczyk
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data Imputation in Merged Isobaric Labeling-Based Relative Quantification Datasets.

Abstract

Talk to us

Similar Papers

More From: Methods in molecular biology (Clifton, N.J.)